Skip to main navigation Skip to search Skip to main content

Towards improving the lexicon-based approach for arabic sentiment analysis

  • Nawaf A. Abdulla
  • , Nizar A. Ahmed
  • , Mohammed A. Shehab
  • , Mahmoud Al-Ayyoub
  • , Mohammed N. Al-Kabi
  • , Saleh Al-rifai
  • Jordan University of Science and Technology
  • Zarqa University

Research output: Contribution to journalArticlepeer-review

84 Scopus citations

Abstract

The emergence of the Web 2.0 technology generated a massive amount of raw data by enabling Internet users to post their opinions on the web. Processing this raw data to extract useful information can be a very challenging task. An example of important information that can be automatically extracted from the users' posts is their opinions on different issues. This problem of Sentiment Analysis (SA) has been studied well on the English language and two main approaches have been devised: corpus-based and lexicon-based. This work focuses on the later approach due to its various challenges and high potential. The discussions in this paper take the reader through the detailed steps of building the main two components of the lexicon-based SA approach: the lexicon and the SA tool. The experiments show that significant efforts are still needed to reach a satisfactory level of accuracy for the lexicon-based Arabic SA. Nonetheless, they do provide an interesting guide for the researchers in their on-going efforts to improve lexicon-based SA.

Original languageEnglish
Pages (from-to)55-71
Number of pages17
JournalInternational Journal of Information Technology and Web Engineering
Volume9
Issue number3
DOIs
StatePublished - 1 Jul 2014
Externally publishedYes

Keywords

  • Arabic text pre-processing
  • Dialectal Arabic
  • Intensification
  • Sentiment lexicon construction
  • Stemming
  • Switch negation
  • Twitter
  • Yahoo!-Maktoob

Fingerprint

Dive into the research topics of 'Towards improving the lexicon-based approach for arabic sentiment analysis'. Together they form a unique fingerprint.

Cite this