LEADER |
01524nam a22002177a 4500 |
001 |
0011424 |
041 |
|
|
|a eng
|
044 |
|
|
|b المغرب
|
100 |
|
|
|9 38145
|a Lee, Mark
|e Author
|
245 |
|
|
|a Automatic Multi-Dialect Analysis of Arabic
|
260 |
|
|
|b مؤسسة العرفان للإستشارات التربوية والتطوير المهني
|c 2014
|
300 |
|
|
|a 95 - 108
|
336 |
|
|
|a بحوث ومقالات
|b Article
|
520 |
|
|
|b In this paper we address the problem of the analysis of multi-dialect Arabic morphology. Our method involves the synthesis of two methods. The first method is linguistic, using an adopted Modern Standard Arabic (MSA) Morphology Analyser to first deal with dialect prefixes and suffixes and then analyse remaining word fragment. This method improves accuracy of dialect words by 69%. The second method involves segmenting the word and using ‘the web as corpus' to estimate the frequency of different segment combinations which then are used to guess the correct base form. The overall synthesis is shown to have 94% accuracy on a corpus of Arabic dialects.
|
653 |
|
|
|a اللغة العربية
|a اللهجات العربية
|a الكمبيوتر
|a التحليل اللغوي
|
700 |
|
|
|9 15377
|a Al Meman, Khalid
|e Advisor
|
773 |
|
|
|4 علم اللغة
|6 Linguistics
|c 003
|l 990
|m مج16, ملحق
|o 1361
|s مجلة التواصل اللساني
|t Journal of linguistic communication
|v 016
|x 0851-6774
|
856 |
|
|
|u 1361-016-990-003.pdf
|
930 |
|
|
|d y
|p y
|q y
|
995 |
|
|
|a AraBase
|
999 |
|
|
|c 597022
|d 597022
|