DocumentCode :
1652678
Title :
Lip feature extraction and reduction for HMM-based visual speech recognition systems
Author :
Alizadeh, S. ; Boostani, R. ; Asadpour, V.
Author_Institution :
Dept. of Comput. Sci. & Eng., Shiraz Univ., Shiraz
fYear :
2008
Firstpage :
561
Lastpage :
564
Abstract :
Lipreading is a main part of audio-visual speech recognition systems which are mostly faced with redundancy of extracted features. In this paper, a new approach has been proposed to increase the lipreading performance by extraction of discriminant features. In this way, first, faces are detected; then, lip key points are extracted in which four cubic curves characterize lip contours. Next, the visual features are extracted from the contours for each frame. To discriminate each speech unit (word) from others, features of that speech unit frames are arranged in a feature vector. Moreover, differences of each frame features from k previous frame features are used to construct more informative feature vectors. To solve the small sample size problem, direct linear discriminant analysis (D-LDA) is employed to reduce the feature size. To classify these transformed features, hidden Markov model (HMM) is used to recognize the speech units. The proposed algorithm was applied on M2VTS database. Results show that applying of D-LDA for feature reduction provides the better classification accuracy compare to employ HMM without feature reduction.
Keywords :
audio-visual systems; feature extraction; hidden Markov models; speech recognition; HMM-based visual speech recognition systems; audio-visual speech recognition system; direct linear discriminant analysis; hidden Markov model; informative feature vectors; lip feature extraction; speech unit frame; Automatic speech recognition; Data mining; Face detection; Feature extraction; Hidden Markov models; Lips; Skin; Speech recognition; Vectors; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-2178-7
Electronic_ISBN :
978-1-4244-2179-4
Type :
conf
DOI :
10.1109/ICOSP.2008.4697195
Filename :
4697195
Link To Document :
بازگشت