LEADER |
01560nam a22002177a 4500 |
001 |
0011327 |
041 |
|
|
|a eng
|
044 |
|
|
|b المغرب
|
100 |
|
|
|9 46913
|a Sari, Toufik
|e Author
|
245 |
|
|
|a Correcting Arabic OCR Outputs by Morphological Analysis of Words
|
260 |
|
|
|b مؤسسة العرفان للإستشارات التربوية والتطوير المهني
|c 2013
|
300 |
|
|
|a 171 - 184
|
336 |
|
|
|a بحوث ومقالات
|b Article
|
520 |
|
|
|b In this paper, we propose a contextual-based method for correcting Arabic words generated by OCR systems. This technique operates as a post-processor and it wants to be universal. It corrects substitution and rejection errors. The Arabic language properties are very useful in morpho-lexical analysis and therefore they are strongly exploited in the development of the method. The substitution errors, the most frequently committed ones by the OCR systems, are rewritten in production rules to be used by a rule-based system for correcting Arabic words. The first version of the contextual method operates only at the morpho-lexical level, the extension to the other levels of language analysis is considered in perspectives.
|
653 |
|
|
|a اللغة العربية
|a علم الصرف
|
700 |
|
|
|9 47282
|a Sellami, Mokhtar
|e Advisor
|
773 |
|
|
|4 علم اللغة
|6 Linguistics
|c 002
|l 001,002
|m مج15, ع1,2
|o 1361
|s مجلة التواصل اللساني
|t Journal of linguistic communication
|v 015
|x 0851-6774
|
856 |
|
|
|u 1361-015-001,002-002.pdf
|
930 |
|
|
|d y
|p y
|q y
|
995 |
|
|
|a AraBase
|
999 |
|
|
|c 596869
|d 596869
|