DocumentCode :
3327519
Title :
Geometrical Feature Extraction for Robust Speech Recognition
Author :
Li, Xiaokun ; Kwan, Chiman
Author_Institution :
Signal/Image Process. & Control Group, Intelligent Autom., Inc., Rockville, MD
fYear :
2005
fDate :
Oct. 28 2005-Nov. 1 2005
Firstpage :
558
Lastpage :
562
Abstract :
Visual information from lip contour has been successfully shown to improve the robustness of automatic speech recognition especially in noisy environments. In this paper, a novel method for lip reading is presented. In the method, hue information of input images is used for lip area detection. Then, a set of morphological operations is applied to detect lip contour. Polynomial fitting is designed for geometrical feature extraction. With the extracted features, hidden Markov models and Gaussian mixture models are trained to recognize speech. The experimental results demonstrated that the proposed method improved speech recognition rates in noisy environment. Another advantage of the method is its robustness to lighting variances
Keywords :
Gaussian processes; feature extraction; hidden Markov models; polynomials; speech recognition; Gaussian mixture models; geometrical feature extraction; hidden Markov models; lip area detection; lip contour; polynomial fitting; robust speech recognition; Automatic speech recognition; Discrete cosine transforms; Feature extraction; Hidden Markov models; Morphological operations; Mouth; Robustness; Shape; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 2005. Conference Record of the Thirty-Ninth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
1-4244-0131-3
Type :
conf
DOI :
10.1109/ACSSC.2005.1599811
Filename :
1599811
Link To Document :
بازگشت