Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks
Citations Over TimeTop 1% of 2016 papers
Abstract
Automatic detection and classification of dynamic hand gestures in real-world systems intended for human computer interaction is challenging as: 1) there is a large diversity in how people perform gestures, making detection and classification difficult, 2) the system must work online in order to avoid noticeable lag between performing a gesture and its classification, in fact, a negative lag (classification before the gesture is finished) is desirable, as feedback to the user can then be truly instantaneous. In this paper, we address these challenges with a recurrent three-dimensional convolutional neural network that performs simultaneous detection and classification of dynamic hand gestures from multi-modal data. We employ connectionist temporal classification to train the network to predict class labels from inprogress gestures in unsegmented input streams. In order to validate our method, we introduce a new challenging multimodal dynamic hand gesture dataset captured with depth, color and stereo-IR sensors. On this challenging dataset, our gesture recognition system achieves an accuracy of 83:8%, outperforms competing state-of-the-art algorithms, and approaches human accuracy of 88:4%. Moreover, our method achieves state-of-the-art performance on SKIG and ChaLearn2014 benchmarks.
Related Papers
- → Literature review on dynamic hand gesture recognition(2022)9 cited
- → An Intelligent Smart Home Control Using Body Gestures(2006)18 cited
- → An Intelligent Smart Home Control Using Body Gestures(2006)16 cited
- → Real‐time hand gestures system based on leap motion(2018)11 cited
- Real-Time Gesture Recognition for Dynamic Applications(2015)