Title :
Visual speech recognition using Convolutional VEF snake and canonical correlations
Author :
Lu, Kun ; Wu, Yuwei ; Jia, Yunde
Author_Institution :
Sch. of Software, Beijing Inst. of Technol., Beijing, China
Abstract :
This paper presents a novel approach for automatic visual speech recognition using Convolutional VEF snake and canonical correlations. The utterance image sequences of isolated Chinese words are recorded with a head-mounted camera, and we use Convolutional VEF snake model to detect and track lip boundary rapidly and accurately. Geometric and motion features are both extracted from lip contour sequences and concatenated to form a joint feature descriptor. Canonical correlation is applied to measure the similarity of two utterance feature matrices and a linear discriminant function is introduced to make further improvement on the recognition accuracy. Experimental results demonstrate that our approach is promising and the joint feature descriptor is more robust than individual ones.
Keywords :
cameras; convolutional codes; feature extraction; image sequences; speech recognition; automatic visual speech recognition; canonical correlations; convolutional VEF snake; geometric features; head-mounted camera; image sequences; isolated Chinese words; joint feature descriptor; linear discriminant function; lip boundary; lip contour sequences; motion features; utterance feature matrices; Correlation; Feature extraction; Hidden Markov models; Image sequences; Speech; Speech recognition; Visualization; Visual speech recognition; canonical correlation; head-camera; joint feature descriptor; snake model;
Conference_Titel :
Information Computing and Telecommunications (YC-ICT), 2010 IEEE Youth Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-8883-4
DOI :
10.1109/YCICT.2010.5713068