Enhancing Arabic Stemming Process Using Resources And Benchmarking Tools

Jaafar, Younes; Namly, Driss; Bouzoubaa, Karim; Yousfi, Abdellah

Enhancing Arabic Stemming Process Using Resources And Benchmarking Tools

المصدر:	مجلة جامعة الملك سعود - علوم الحاسب والمعلومات
الناشر:	جامعة الملك سعود
المؤلف الرئيسي:	Jaafar, Younes (Author)
مؤلفين آخرين:	Namly, Driss (Co-Author) , Bouzoubaa, Karim (Co-Author) , Yousfi, Abdellah (Co-Author)
المجلد/العدد:	مج29, ع2
محكمة:	نعم
الدولة:	السعودية
التاريخ الميلادي:	2017
الصفحات:	164 - 170
DOI:	10.33948/0584-029-002-004
ISSN:	1319-1578
رقم MD:	974090
نوع المحتوى:	بحوث ومقالات
اللغة:	الإنجليزية
قواعد المعلومات:	science
مواضيع:	اللغة العربية \| التراكيب اللغوية \| اللسانيات الحاسوبية
كلمات المؤلف المفتاحية:	Arabic Stemming \| Evaluation \| Benchmark \| Evaluation Corpus
رابط المحتوى:	PDF (صورة)

LEADER	02414nam a22002657a 4500
001	1716921
024		\|3 10.33948/0584-029-002-004
041		\|a eng
044		\|b السعودية
100		\|9 525331 \|a Jaafar, Younes \|e Author
245		\|a Enhancing Arabic Stemming Process Using Resources And Benchmarking Tools
260		\|b جامعة الملك سعود \|c 2017
300		\|a 164 - 170
336		\|a بحوث ومقالات \|b Article
520		\|b Many approaches and solutions have been proposed for developing Arabic light stemmers. These stemmers are often used in the context of application-oriented projects, especially when it comes to developing information retrieval (IR) systems. However, Arabic light stemming, as the process of stripping off a set of prefixes and/or suffixes, is a blinded task suffering from problems such as incorrect removal, vocalization ambiguity, single solution, etc. Moreover, each researcher claims that his/her stemmer reached a level of strength and accuracy quite high. However, in most cases, these stemmers are black boxes and it is not possible to access neither their source codes to verify their validity, nor the evaluation corpora that were used to claim such accuracy. Since these stemmers are very important for researchers, their comparison and evaluation is then essential to facilitate the choice of the stemmer to use in a given project. In this paper, we propose a new Arabic stemmer that gives solutions to the above mentioned drawbacks. In addition, we propose an automatic approach for the evaluation and comparison of Arabic stemmers that takes into account metrics related to the accuracy of results as well as the execution time of stemmers.
653		\|a اللغة العربية \|a التراكيب اللغوية \|a اللسانيات الحاسوبية
692		\|b Arabic Stemming \|b Evaluation \|b Benchmark \|b Evaluation Corpus
700		\|9 525332 \|a Namly, Driss \|e Co-Author
700		\|9 24403 \|a Bouzoubaa, Karim \|e Co-Author
700		\|9 525334 \|a Yousfi, Abdellah \|e Co-Author
773		\|c 004 \|e Journal of King Saud University (Computer and Information Sciences) \|f Maǧalaẗ ǧamʼaẗ al-malīk Saud : ùlm al-ḥasib wa al-maʼlumat \|l 002 \|m مج29, ع2 \|o 0584 \|s مجلة جامعة الملك سعود - علوم الحاسب والمعلومات \|v 029 \|x 1319-1578
856		\|u 0584-029-002-004.pdf
930		\|d y \|p y
995		\|a science
999		\|c 974090 \|d 974090

عناصر مشابهة

Toward An Enhanced Arabic Text Classification Using Cosine Similarity And Latent Semantic Indexing
بواسطة: Al-Anzi, Fawaz S. منشور: (2017)
Arabic Word Processing And Morphology Induction Through Adaptive Memory Self Organisation Strategies
بواسطة: Marzi, Claudia منشور: (2017)
Towards A Standard Part Of Speech Tagset For The Arabic Language
بواسطة: Zeroual, Imad منشور: (2017)
Rational Kernels for Arabic Root Extraction and Text Classification
بواسطة: Nehar, Attia منشور: (2016)
Arabic Natural Language Processing: Models, Systems And Applications
بواسطة: Pirrelli, Vito منشور: (2017)

Enhancing Arabic Stemming Process Using Resources And Benchmarking Tools

عناصر مشابهة

دليل المستخدم

دليل الفيديو