DocumentCode :
729729
Title :
Sign Language Recognition using 3D convolutional neural networks
Author :
Jie Huang ; Wengang Zhou ; Houqiang Li ; Weiping Li
Author_Institution :
Univ. of Sci. & Technol. of China, Hefei, China
fYear :
2015
fDate :
June 29 2015-July 3 2015
Firstpage :
1
Lastpage :
6
Abstract :
Sign Language Recognition (SLR) targets on interpreting the sign language into text or speech, so as to facilitate the communication between deaf-mute people and ordinary people. This task has broad social impact, but is still very challenging due to the complexity and large variations in hand actions. Existing methods for SLR use hand-crafted features to describe sign language motion and build classification models based on those features. However, it is difficult to design reliable features to adapt to the large variations of hand gestures. To approach this problem, we propose a novel 3D convolutional neural network (CNN) which extracts discriminative spatial-temporal features from raw video stream automatically without any prior knowledge, avoiding designing features. To boost the performance, multi-channels of video streams, including color information, depth clue, and body joint positions, are used as input to the 3D CNN in order to integrate color, depth and trajectory information. We validate the proposed model on a real dataset collected with Microsoft Kinect and demonstrate its effectiveness over the traditional approaches based on hand-crafted features.
Keywords :
handicapped aids; image colour analysis; neural nets; sign language recognition; social sciences; video signal processing; 3D convolutional neural networks; CNN; Microsoft Kinect; SLR targets; body joint positions; color information; deaf-mute people; depth clue; hand gestures; ordinary people; sign language recognition; social impact; video streams; Assistive technology; Convolution; Feature extraction; Gesture recognition; Hidden Markov models; Three-dimensional displays; Trajectory; 3D Convolutional Neural Networks; Deep Learning; Sign Language Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location :
Turin
Type :
conf
DOI :
10.1109/ICME.2015.7177428
Filename :
7177428
Link To Document :
بازگشت