LEADER |
01647nam a22002297a 4500 |
001 |
0008550 |
041 |
|
|
|a eng
|
044 |
|
|
|b المغرب
|
100 |
|
|
|9 45616
|a Rozovoskaya, Alla
|e Author
|
245 |
|
|
|a Language Modeling of Arabic Dialects
|
260 |
|
|
|b معهد الدراسات والأبحاث للتعريب
|c 2007
|g يناير
|
300 |
|
|
|a 252 - 262
|
336 |
|
|
|a بحوث المؤتمرات
|b Article
|
500 |
|
|
|a المقال باللغة الانجليزية
|
520 |
|
|
|b This paper describes several approaches to language modeling of Arabic dialects using Modern Standard Arabic (MSA) data. We build a baseline language model on words and experiment with various techniques of data transformation to account for differences between MSA and Colloquial Arabic. Specifically, we describe three methods of data transformation: morphological simplification (stemming), lexical transductions, and syntactic transformations. We compare the performance of each method with that of the baseline language model. While the best performing model remains the one built using only dialectal data, these techniques allow us to obtain an improvement over the baseline MSA model.
|
653 |
|
|
|a النمذجة
|a اللهجات العربية
|a اللغة العربية
|
700 |
|
|
|9 48918
|a Sproat, Richard
|e Co-Author
|
773 |
|
|
|c 012
|l 000
|o 6868
|s وقائع الندوة الدولية : المعالجة الآلية للغة العربية
|v 000
|d الرباط
|i معهد الدراسات والأبحاث للتعريب بالرباط - جامعة محمد الخامس
|
856 |
|
|
|u 6868-000-000-012.pdf
|
930 |
|
|
|d y
|p y
|
995 |
|
|
|a AraBase
|
999 |
|
|
|c 593921
|d 593921
|