DocumentCode :
2513411
Title :
Visual speech recognition using Convolutional VEF snake and canonical correlations
Author :
Lu, Kun ; Wu, Yuwei ; Jia, Yunde
Author_Institution :
Sch. of Software, Beijing Inst. of Technol., Beijing, China
fYear :
2010
fDate :
28-30 Nov. 2010
Firstpage :
154
Lastpage :
157
Abstract :
This paper presents a novel approach for automatic visual speech recognition using Convolutional VEF snake and canonical correlations. The utterance image sequences of isolated Chinese words are recorded with a head-mounted camera, and we use Convolutional VEF snake model to detect and track lip boundary rapidly and accurately. Geometric and motion features are both extracted from lip contour sequences and concatenated to form a joint feature descriptor. Canonical correlation is applied to measure the similarity of two utterance feature matrices and a linear discriminant function is introduced to make further improvement on the recognition accuracy. Experimental results demonstrate that our approach is promising and the joint feature descriptor is more robust than individual ones.
Keywords :
cameras; convolutional codes; feature extraction; image sequences; speech recognition; automatic visual speech recognition; canonical correlations; convolutional VEF snake; geometric features; head-mounted camera; image sequences; isolated Chinese words; joint feature descriptor; linear discriminant function; lip boundary; lip contour sequences; motion features; utterance feature matrices; Correlation; Feature extraction; Hidden Markov models; Image sequences; Speech; Speech recognition; Visualization; Visual speech recognition; canonical correlation; head-camera; joint feature descriptor; snake model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Computing and Telecommunications (YC-ICT), 2010 IEEE Youth Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8883-4
Type :
conf
DOI :
10.1109/YCICT.2010.5713068
Filename :
5713068
Link To Document :
بازگشت