TY - GEN
T1 - Hybrid Deep Learning Algorithm for Real-Time Emotion Detection from Facial Expressions
AU - Praveen, R. V.S.
AU - Shrivastava, Anurag
AU - Al Said, Nidal
AU - Habelalmateen, Mohammed I.
AU - Yadav, Kanchan
AU - Haleem, Ali S.
AU - Alhayaly, Mostafa Abdulsalam
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Effective and real-time facial expression detection is becoming increasingly crucial in affective computing, intelligent user interfaces, and mental health screening to comprehend human emotions. Lighting, face angle, and occlusion changes typically challenge standard emotion recognition algorithms in real-life, ever-changing scenarios. This paper introduces EmotionSenseNet, a novel Emotion Sensing Network with Spatiotemporal and Attention Fusion, a deep learning network that combines spatial, temporal, and attentional data, improving emotion identification and targeting these issues. The design uses a lightweight CNN based on MobileNetV2 to extract spatial information and an LSTM network to define facial motion temporal correlations. The approach is simplified with a focus module highlighting essential face traits and periods. Face landmark alignment preprocessing provides input normalization across face pictures. The suggested technique has minimal latency and 91.3% accuracy on benchmark datasets like FER-2013 and CK+, making it suited for real-time edge device applications. Despite practical constraints, EmotionSenseNet can identify emotions reliably and scalable. Therefore, future research will focus on establishing more extensive emotional classifications and energy-efficient embedded system deployment techniques.
AB - Effective and real-time facial expression detection is becoming increasingly crucial in affective computing, intelligent user interfaces, and mental health screening to comprehend human emotions. Lighting, face angle, and occlusion changes typically challenge standard emotion recognition algorithms in real-life, ever-changing scenarios. This paper introduces EmotionSenseNet, a novel Emotion Sensing Network with Spatiotemporal and Attention Fusion, a deep learning network that combines spatial, temporal, and attentional data, improving emotion identification and targeting these issues. The design uses a lightweight CNN based on MobileNetV2 to extract spatial information and an LSTM network to define facial motion temporal correlations. The approach is simplified with a focus module highlighting essential face traits and periods. Face landmark alignment preprocessing provides input normalization across face pictures. The suggested technique has minimal latency and 91.3% accuracy on benchmark datasets like FER-2013 and CK+, making it suited for real-time edge device applications. Despite practical constraints, EmotionSenseNet can identify emotions reliably and scalable. Therefore, future research will focus on establishing more extensive emotional classifications and energy-efficient embedded system deployment techniques.
KW - EmotionSenseNet
KW - Facial Emotion Recognition
KW - Hybrid Deep Learning
KW - Real-Time Emotion Detection
KW - Spatiotemporal Feature Fusion
UR - https://www.scopus.com/pages/publications/105031584331
U2 - 10.1109/ICCR67387.2025.11292359
DO - 10.1109/ICCR67387.2025.11292359
M3 - Conference contribution
AN - SCOPUS:105031584331
T3 - ICCR 2025 - 3rd International Conference on Cyber Resilience
BT - ICCR 2025 - 3rd International Conference on Cyber Resilience
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 3rd International Conference on Cyber Resilience, ICCR 2025
Y2 - 3 July 2025 through 4 July 2025
ER -