المستخلص: |
Sentiment analysis is the process of determining a predefined sentiment from text written in a natural language with respect to the entity to which it is referring. A number of lexical resources are available to facilitate this task in English. One such resource is the Senti Word Net, which assigns sentiment scores to words found in the English Word Net. In this paper, we present an Arabic sentiment lexicon that assigns sentiment scores to the words found in the Arabic Word Net. Starting from a small seed list of positive and negative words, we used semi-supervised learning to propagate the scores in the Arabic Word Net by exploiting the synset relations. Our algorithm assigned a positive sentiment score to more than 800, a negative score to more than 600 and a neutral score to more than 6000 words in the Arabic Word Net. The lexicon was evaluated by incorporating it into a machine learning-based classifier. The experiments were conducted on several Arabic sentiment corpora, and we were able to achieve a 96% classification accuracy.
|