TY - GEN
T1 - Are emoticons good enough to train emotion classifiers of Arabic tweets?
AU - Hussien, Wegdan A.
AU - Tashtoush, Yahya M.
AU - Al-Ayyoub, Mahmoud
AU - Al-Kabi, Mohammed N.
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/8/23
Y1 - 2016/8/23
N2 - Nowadays, the automatic detection of emotions is employed by many applications across different fields like security informatics, e-learning, humor detection, targeted advertising, etc. Many of these applications focus on social media. In this study, we address the problem of emotion detection in Arabic tweets. We focus on the supervised approach for this problem where a classifier is trained on an already labeled dataset. Typically, such a training set is manually annotated, which is expensive and time consuming. We propose to use an automatic approach to annotate the training data based on using emojis, which are a new generation of emoticons. We show that such an approach produces classifiers that are more accurate than the ones trained on a manually annotated dataset. To achieve our goal, a dataset of emotional Arabic tweets is constructed, where the emotion classes under consideration are: anger, disgust, joy and sadness. Moreover, we consider two classifiers: Support Vector Machine (SVM) and Multinomial Naive Bayes (MNB). The results of the tests show that the automatic labeling approaches using SVM and MNB outperform manual labeling approaches.
AB - Nowadays, the automatic detection of emotions is employed by many applications across different fields like security informatics, e-learning, humor detection, targeted advertising, etc. Many of these applications focus on social media. In this study, we address the problem of emotion detection in Arabic tweets. We focus on the supervised approach for this problem where a classifier is trained on an already labeled dataset. Typically, such a training set is manually annotated, which is expensive and time consuming. We propose to use an automatic approach to annotate the training data based on using emojis, which are a new generation of emoticons. We show that such an approach produces classifiers that are more accurate than the ones trained on a manually annotated dataset. To achieve our goal, a dataset of emotional Arabic tweets is constructed, where the emotion classes under consideration are: anger, disgust, joy and sadness. Moreover, we consider two classifiers: Support Vector Machine (SVM) and Multinomial Naive Bayes (MNB). The results of the tests show that the automatic labeling approaches using SVM and MNB outperform manual labeling approaches.
KW - Arabic emotion analysis
KW - Arabic emotion annotation
KW - emojis
KW - sentiment analysis
UR - https://www.scopus.com/pages/publications/84987642658
U2 - 10.1109/CSIT.2016.7549459
DO - 10.1109/CSIT.2016.7549459
M3 - Conference contribution
AN - SCOPUS:84987642658
T3 - Proceedings - CSIT 2016: 2016 7th International Conference on Computer Science and Information Technology
BT - Proceedings - CSIT 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th International Conference on Computer Science and Information Technology, CSIT 2016
Y2 - 13 July 2016 through 14 July 2016
ER -