LEADER |
01536nam a22002057a 4500 |
001 |
0012983 |
041 |
|
|
|a eng
|
044 |
|
|
|b المغرب
|
100 |
|
|
|9 23555
|a Benajibe, Yassine
|e Author
|
245 |
|
|
|a Towards Measure for Arabic Corpora Quality
|
260 |
|
|
|b معهد الدراسات والأبحاث للتعريب
|c 2007
|g يونيو
|
300 |
|
|
|a 213 - 221
|
336 |
|
|
|a بحوث المؤتمرات
|b Article
|
520 |
|
|
|b In this paper we present a statistical measure which for the first time is used to evaluate the quality of Arabic corpora. This measure is entirely based on statistical data and language-independent. However, the values which might be obtained in the experiments could be very different for corpora written in different languages. Our experiments were conducted using Arabic corpora. We have chosen four corpora of different types in order to determine the corpus charcteristics reflected by our quality measure. The preliminary results show that the measure is significantly correlated with the writing style and the nature of the text.
|
653 |
|
|
|a المؤتمرات و الندوات
|a مستخلصات الأبحاث
|a اللغة العربية
|a النحو والصرف
|
773 |
|
|
|c 005
|d الرباط
|i منشورات معهد الدراسات والأبحاث للتعريب جامعة محمد الخامس
|l 000
|o 6904
|s الندوة الدولية: المعالجة الآلية للغة العربية CITALA'07
|v 000
|
856 |
|
|
|u 6904-000-000-005.pdf
|
930 |
|
|
|d y
|p y
|
995 |
|
|
|a AraBase
|
999 |
|
|
|c 600096
|d 600096
|