LEADER |
02096nam a22002537a 4500 |
001 |
1716707 |
024 |
|
|
|3 10.33948/0584-028-002-002
|
041 |
|
|
|a eng
|
044 |
|
|
|b السعودية
|
100 |
|
|
|9 525126
|a Nehar, Attia
|e Author
|
245 |
|
|
|a Rational Kernels for Arabic Root Extraction and Text Classification
|
260 |
|
|
|b جامعة الملك سعود
|c 2016
|
300 |
|
|
|a 157 - 169
|
336 |
|
|
|a بحوث ومقالات
|b Article
|
520 |
|
|
|b In this paper, we address the problems of Arabic Text Classification and root extraction using transducers and rational kernels. We introduce a new root extraction approach on the basis of the use of Arabic patterns (Pattern Based Stemmer). Transducers are used to model these patterns and root extraction is done without relying on any dictionary. Using transducers for extracting roots, documents are transformed into finite state transducers. This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification. Root extraction experiments are conducted on three word collections and yield 75.6% of accuracy. Classification experiments are done on the Saudi Press Agency dataset and N-gram kernels are tested with different values of N. Accuracy and F1 report 90.79% and 62.93% respectively. These results show that our approach, when compared with other approaches, is promising specially in terms of accuracy and F1.
|
653 |
|
|
|a اللسانيات الحاسوبية
|a اللغة العربية
|a التراكيب اللغوية
|
692 |
|
|
|b N-Gram
|b Arabic
|b Classification
|b Rational Kernels
|b Automata
|b Transducers
|
700 |
|
|
|9 525127
|a Ziadi, Djelloul
|e Co-Author
|
700 |
|
|
|9 525128
|a Cherroun, Hadda
|e Co-Author
|
773 |
|
|
|c 002
|e Journal of King Saud University (Computer and Information Sciences)
|f Maǧalaẗ ǧamʼaẗ al-malīk Saud : ùlm al-ḥasib wa al-maʼlumat
|l 002
|m مج28, ع2
|o 0584
|s مجلة جامعة الملك سعود - علوم الحاسب والمعلومات
|v 028
|x 1319-1578
|
856 |
|
|
|u 0584-028-002-002.pdf
|
930 |
|
|
|d y
|p y
|q n
|
995 |
|
|
|a science
|
999 |
|
|
|c 973846
|d 973846
|