Real-time dynamic hand gesture recognition system on edge devices

Keywords: real-time gesture recognition, human-computer interaction, dynamic hand gestures, deep learning, computer vision, edge devices

Abstract

This paper presents a novel real-time dynamic hand gesture recognition system designed for efficient interaction with smart devices and touchless interfaces, with a focus on edge devices. The proposed system integrates Google Mediapipe for hand pose detection, which is lightweight enough to run on mobile devices, with a modified version of the DD-Net architecture, optimized for online classification of gestures using 2D and 3D data. Key innovations include the introduction of an auxiliary classification head to address class imbalance and an attention mechanism to improve the recognition of partially observed gestures. The system is evaluated on the NVGesture and SHREC22 datasets, achieving an accuracy of 0.784 and 0.924, respectively, surpassing previous benchmarks. Experimental results demonstrate the high efficiency of the proposed approach in real-time gesture recognition tasks, particularly on mobile platforms. It is shown that the use of 3D data and an attention mechanism significantly improves recognition accuracy, especially for complex and partially visible gestures. The proposed system can be used in a wide range of applications, including virtual and augmented reality systems, robotics, and touchless interfaces on edge devices

References

1. Iqbal, M.Z., Campbell, A.G. (2021). From luxury to necessity: Progress of touchless interaction technology. Technology in Society. Volume 67, November 2021, 101796. https://doi.org/10.1016/j.techsoc.2021.101796
2. Teslyuk, V., Chornenkyi, V., Tsapiv, V., & Kazymyra, I. (2024). Investigating hand gesture recognition models: 2D CNNs vs. visual transformers. CEUR Workshop Proceedings. Vol. 3688: Proc. of the 8th Int. conf. on computational linguistics and intelligent systems. Vol. III: Intelligent systems workshop. Lviv, Ukraine, Apr. 12-13, 2024, 29–38. CEUR-WS.org.
3. Tsapiv, V., & Kazymyra, I. (2024). Gesture Control in Real-time: a framework for 3D hand pose recognition for smart devices and touchless interfaces. In 1st International Scientific and Practical Conference on Computational Intelligence and Smart Systems, 126–128.
4. Zhang, W., Lin, Z., Cheng, J., Ma, C., Deng, X., & Wang, H. (2020). STA-GCN: two-stream graph convolutional network with spatial–temporal attention for hand gesture recognition. The Visual Computer, 36(10–12), 2433–2444.
5. Do, N.-T., Kim, S.-H., Yang, H.-J., & Lee, G.-S. (2020). Robust hand shape features for Dynamic Hand Gesture Recognition using multi-level feature LSTM. Applied Sciences (Basel, Switzerland), 10(18), 6293.

Abstract views: 23
PDF Downloads: 16
Published
2025-06-16
How to Cite
Kazymyra І., & Tsapiv , V. (2025). Real-time dynamic hand gesture recognition system on edge devices. COMPUTER-INTEGRATED TECHNOLOGIES: EDUCATION, SCIENCE, PRODUCTION, (59), 126-135. https://doi.org/10.36910/6775-2524-0560-2025-59-17
Section
Computer science and computer engineering