DocumentCode
1835413
Title
Consideration of Lombard effect for speechreading
Author
Huang, Fu Jie ; Chen, Tsuhan
Author_Institution
Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2001
fDate
2001
Firstpage
613
Lastpage
618
Abstract
We propose a method for integrating audio and visual information to enhance speech recognition in adverse environments. We train the audio hidden Markov model and the visual hidden Markov model separately, and then use a Viterbi algorithm to decode both channels in parallel. The decoding process is asynchronous between the two channels to capture the asynchronous nature of audio and visual speech. We test the proposed method using speech corrupted by various types of noise and speech with the Lombard effect
Keywords
Viterbi decoding; acoustic noise; audio coding; hidden Markov models; speech recognition; video coding; Lombard effect; adverse environments; asynchronous Viterbi decoding; enhanced speech recognition; hidden Markov model; integrated audio-visual information; lip-reading; speechreading; Automatic speech recognition; Background noise; Decoding; Degradation; Hidden Markov models; Speech processing; Speech recognition; State-space methods; Testing; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 2001 IEEE Fourth Workshop on
Conference_Location
Cannes
Print_ISBN
0-7803-7025-2
Type
conf
DOI
10.1109/MMSP.2001.962800
Filename
962800
Link To Document