TY - GEN
T1 - Real-Time Hand Tracking and Trajectory Gesture Recognition
AU - Mohd Asaari, Mohd Shahrimie
AU - Yi, Ooi Shin
AU - Mohsen Saleh, Sami Abdula
AU - Ishak, Mohamad Khairi
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - With the advent of modern technologies, traditional input devices such as the mouse, keyboard, and remote control are becoming obsolete due to their lack of flexibility. In today's society, humans primarily communicate with computers using body language or voice commands, which are now widely integrated into most electronic devices. Hand gestures, in particular, are highly effective in human-computer interaction due to their natural expressiveness. However, vision-based hand gesture recognition systems face challenges such as complex backgrounds, illumination variations, and other environmental factors. Additionally, reliably detecting and tracking hands to extract trajectory information from video scenes remains a difficult task due to the diverse appearances of human hands, including variations in hand shapes, skin colors, illuminations, orientations, and scales in color images. Distinguishing between meaningful and meaningless motion trajectories further complicates dynamic hand gesture recognition. To address these challenges, a real-time hand tracking and gesture recognition system is proposed. This project implements real-time hand tracking and landmark estimation using Python, OpenCV, and MediaPipe. Hand trajectory gesture recognition is then achieved using a customized Convolutional Neural Network (CNN) with three convolutional layers, one flatten layer, and two fully connected layers. The network is built and trained using Matlab and trained model is exported to python using scipy.io module for model deployment. The model was trained on the MNIST dataset, which consists of 10 numeric gestures (0-9) with an overall testing accuracy of 97. 21%. When implemented on trajectory-based gestures, the system yielded an accuracy of 87.1%.
AB - With the advent of modern technologies, traditional input devices such as the mouse, keyboard, and remote control are becoming obsolete due to their lack of flexibility. In today's society, humans primarily communicate with computers using body language or voice commands, which are now widely integrated into most electronic devices. Hand gestures, in particular, are highly effective in human-computer interaction due to their natural expressiveness. However, vision-based hand gesture recognition systems face challenges such as complex backgrounds, illumination variations, and other environmental factors. Additionally, reliably detecting and tracking hands to extract trajectory information from video scenes remains a difficult task due to the diverse appearances of human hands, including variations in hand shapes, skin colors, illuminations, orientations, and scales in color images. Distinguishing between meaningful and meaningless motion trajectories further complicates dynamic hand gesture recognition. To address these challenges, a real-time hand tracking and gesture recognition system is proposed. This project implements real-time hand tracking and landmark estimation using Python, OpenCV, and MediaPipe. Hand trajectory gesture recognition is then achieved using a customized Convolutional Neural Network (CNN) with three convolutional layers, one flatten layer, and two fully connected layers. The network is built and trained using Matlab and trained model is exported to python using scipy.io module for model deployment. The model was trained on the MNIST dataset, which consists of 10 numeric gestures (0-9) with an overall testing accuracy of 97. 21%. When implemented on trajectory-based gestures, the system yielded an accuracy of 87.1%.
KW - Convolutional neural network
KW - Gesture recognition
KW - Hand tracking
KW - Human-computer interaction
KW - MediaPipe
UR - https://www.scopus.com/pages/publications/105007538426
U2 - 10.1109/ICCIT63348.2025.10989294
DO - 10.1109/ICCIT63348.2025.10989294
M3 - Conference contribution
AN - SCOPUS:105007538426
T3 - Proceedings of 2025 4th International Conference on Computing and Information Technology, ICCIT 2025
SP - 50
EP - 55
BT - Proceedings of 2025 4th International Conference on Computing and Information Technology, ICCIT 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Computing and Information Technology, ICCIT 2025
Y2 - 13 April 2025 through 14 April 2025
ER -