Skip to main navigation Skip to search Skip to main content

Emotion analysis of Arabic articles and its impact on identifying the author's gender

  • Kholoud Alsmearat
  • , Mohammed Shehab
  • , Mahmoud Al-Ayyoub
  • , Riyad Al-Shalabi
  • , Ghassan Kanaan
  • Jordan University of Science and Technology
  • Amman Arab University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

The Gender Identification (GI) problem is concerned with determining the gender of the author of a given text based on its contents. The GI problem is one of the authorship profiling problems which have a wide range of applications in various fields such as marketing and security. Due to its importance, extensive research efforts have been invested in the GI problem for different languages. Unfortunately, the same cannot be said about the Arabic language despite its strategic importance and widespread. In this work, we explore the GI problem for Arabic text as a supervised learning problem. Specifically, we consider and compare two approaches for feature extraction. The first one is the Bag-Of-Words (BOW) approach while the second one is based on computing features related to sentiments and emotions. One goal of this work is to confirm the validity of the common stereotype that female authors tend to write in a more emotional way than male authors. Our results show that there is no conclusive evidence that this is true for our dataset.

Original languageEnglish
Title of host publication2015 IEEE/ACS 12th International Conference of Computer Systems and Applications, AICCSA 2015
PublisherIEEE Computer Society
ISBN (Electronic)9781509004782
DOIs
StatePublished - 7 Jul 2016
Externally publishedYes
Event12th IEEE/ACS International Conference of Computer Systems and Applications, AICCSA 2015 - Marrakech, Morocco
Duration: 17 Nov 201520 Nov 2015

Publication series

NameProceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA
Volume2016-July
ISSN (Print)2161-5322
ISSN (Electronic)2161-5330

Conference

Conference12th IEEE/ACS International Conference of Computer Systems and Applications, AICCSA 2015
Country/TerritoryMorocco
CityMarrakech
Period17/11/1520/11/15

Keywords

  • Arabic text analysis
  • bag-of-words
  • emotions lexicons
  • gender identification
  • sentiments lexicons

Fingerprint

Dive into the research topics of 'Emotion analysis of Arabic articles and its impact on identifying the author's gender'. Together they form a unique fingerprint.

Cite this