Skip to main navigation Skip to search Skip to main content

Arabic sentiment analysis: Lexicon-based and corpus-based

  • Jordan University of Science and Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

267 Scopus citations

Abstract

The emergence of the Web 2.0 technology generated a massive amount of raw data by enabling Internet users to post their opinions, reviews, comments on the web. Processing this raw data to extract useful information can be a very challenging task. An example of important information that can be automatically extracted from the users' posts and comments is their opinions on different issues, events, services, products, etc. This problem of Sentiment Analysis (SA) has been studied well on the English language and two main approaches have been devised: corpus-based and lexicon-based. This paper addresses both approaches to SA for the Arabic language. Since there is a limited number of publically available Arabic dataset and Arabic lexicons for SA, this paper starts by building a manually annotated dataset and then takes the reader through the detailed steps of building the lexicon. Experiments are conducted throughout the different stages of this process to observe the improvements gained on the accuracy of the system and compare them to corpus-based approach.

Original languageEnglish
Title of host publication2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies, AEECT 2013
PublisherIEEE Computer Society
ISBN (Print)9781479923038
DOIs
StatePublished - 2013
Externally publishedYes
Event2013 2nd IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies, AEECT 2013 - Amman, Jordan
Duration: 3 Dec 20135 Dec 2013

Publication series

Name2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies, AEECT 2013

Conference

Conference2013 2nd IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies, AEECT 2013
Country/TerritoryJordan
CityAmman
Period3/12/135/12/13

Keywords

  • Arabic language
  • Corpus-based
  • Lexicon-based
  • Opinion mining
  • Sentiment analysis

Fingerprint

Dive into the research topics of 'Arabic sentiment analysis: Lexicon-based and corpus-based'. Together they form a unique fingerprint.

Cite this