TY - GEN
T1 - A bilingual emotion recognition system using deep learning neural networks
AU - Abo Absa, Ahmed H.
AU - Deriche, M.
AU - Mohandes, M.
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/12/7
Y1 - 2018/12/7
N2 - Speech emotion recognition continues to be a challenging problem and a flourishing area of active research especially under mixed language scenarios. In this paper, we show that emotion and language type are dependent and that improved emotion recognition accuracy by taking into consideration the nature of different languages. Recognizing emotion across such language diversity can be challenging and may result in a very large number of classes (product of number of emotion types and number of languages). Here, we propose a cross language emotion recognition system using Deep Learning Neural Networks (DNN). We show that the system is able to recognize accurately the six common types of emotion: Neutral, Happy, Angry, Sad, Fear and Bored. While in our experiments we considered two languages (from Berlin and Polish databases), the work can be extended to a larger pool of languages. The overall recognition accuracy obtained with the proposed technique reached above 93%. It is worth noting that the proposed algorithm outperforms by far the recognition accuracy of systems not considering the nature of specific languages (barely around 50%).
AB - Speech emotion recognition continues to be a challenging problem and a flourishing area of active research especially under mixed language scenarios. In this paper, we show that emotion and language type are dependent and that improved emotion recognition accuracy by taking into consideration the nature of different languages. Recognizing emotion across such language diversity can be challenging and may result in a very large number of classes (product of number of emotion types and number of languages). Here, we propose a cross language emotion recognition system using Deep Learning Neural Networks (DNN). We show that the system is able to recognize accurately the six common types of emotion: Neutral, Happy, Angry, Sad, Fear and Bored. While in our experiments we considered two languages (from Berlin and Polish databases), the work can be extended to a larger pool of languages. The overall recognition accuracy obtained with the proposed technique reached above 93%. It is worth noting that the proposed algorithm outperforms by far the recognition accuracy of systems not considering the nature of specific languages (barely around 50%).
KW - deep learning
KW - language recognition
KW - speech emotion recognition
UR - https://www.scopus.com/pages/publications/85060638821
U2 - 10.1109/SSD.2018.8570407
DO - 10.1109/SSD.2018.8570407
M3 - Conference contribution
AN - SCOPUS:85060638821
T3 - 2018 15th International Multi-Conference on Systems, Signals and Devices, SSD 2018
SP - 1241
EP - 1245
BT - 2018 15th International Multi-Conference on Systems, Signals and Devices, SSD 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Multi-Conference on Systems, Signals and Devices, SSD 2018
Y2 - 19 March 2018 through 22 March 2018
ER -