Title :
Spatio-Temporal Feature-Extraction Techniques for Isolated Gesture Recognition in Arabic Sign Language
Author :
Shanableh, Tamer ; Assaleh, Khaled ; Al-Rousan, M.
Author_Institution :
Dept. of Comput. Sci., American Univ. of Sharjah
fDate :
6/1/2007 12:00:00 AM
Abstract :
This paper presents various spatio-temporal feature-extraction techniques with applications to online and offline recognitions of isolated Arabic Sign Language gestures. The temporal features of a video-based gesture are extracted through forward, backward, and bidirectional predictions. The prediction errors are thresholded and accumulated into one image that represents the motion of the sequence. The motion representation is then followed by spatial-domain feature extractions. As such, the temporal dependencies are eliminated and the whole video sequence is represented by a few coefficients. The linear separability of the extracted features is assessed, and its suitability for both parametric and nonparametric classification techniques is elaborated upon. The proposed feature-extraction scheme was complemented by simple classification techniques, namely, K nearest neighbor (KNN) and Bayesian, i.e., likelihood ratio, classifiers. Experimental results showed classification performance ranging from 97% to 100% recognition rates. To validate our proposed technique, we have conducted a series of experiments using the classical way of classifying data with temporal dependencies, namely, hidden Markov models (HMMs). Experimental results revealed that the proposed feature-extraction scheme combined with simple KNN or Bayesian classification yields comparable results to the classical HMM-based scheme. Moreover, since the proposed scheme compresses the motion information of an image sequence into a single image, it allows for using simple classification techniques where the temporal dimension is eliminated. This is actually advantageous for both computational and storage requirements of the classifier
Keywords :
Bayes methods; data compression; feature extraction; gesture recognition; hidden Markov models; image classification; image motion analysis; image representation; image sequences; natural languages; spatiotemporal phenomena; video signal processing; Arabic Sign Language; Bayesian classification; K nearest neighbor; feature-extraction technique; gesture recognition; hidden Markov model; image motion analysis; image representation; spatio-temporal phenomena; video-based gesture; Auditory system; Bayesian methods; Deafness; Feature extraction; Handicapped aids; Hidden Markov models; Image processing; Pattern recognition; Signal processing algorithms; Speech; Feature extraction; motion analysis; pattern classification; visual languages; Algorithms; Arab World; Artificial Intelligence; Gestures; Humans; Image Interpretation, Computer-Assisted; Pattern Recognition, Automated; Sign Language;
Journal_Title :
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
DOI :
10.1109/TSMCB.2006.889630