DocumentCode :
1867361
Title :
Feature space video stream consistency estimation for dynamic stream weighting in audio-visual speech recognition
Author :
Terry, Louis H. ; Shiell, Derek J. ; Katsaggelos, Aggelos K.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Northwestern Univ., Evanston, IL
fYear :
2008
fDate :
12-15 Oct. 2008
Firstpage :
1316
Lastpage :
1319
Abstract :
Most current audio-visual automatic speech recognition (AV- ASR) systems use static weights to leverage between audio and visual information during information fusion. State of the art research has led to using audio reliability metrics for dynamically changing the fusion weights in order to successfully improve overall recognition results. So far, however, incorporating visual reliability metrics into these audio reliability metric based systems have not significantly improved performance. We introduce a new approach to this problem by inferring the "consistency" between the audio and visual information and leveraging the existing audio reliability metrics to create a video reliability metric. Our approach is formulated in the extracted feature space and, thus, does not rely on analyzing the actual video signal itself. The framework presented in this work competes with the audio-only reliability metric based systems and shows promise to consistently outperform.
Keywords :
feature extraction; speech recognition; video streaming; actual video signal analysis; audio information; audio reliability metrics; audio-visual automatic speech recognition; dynamic stream weighting; feature space extraction; feature space video stream consistency estimation; information fusion; video reliability metric; visual information; Automatic speech recognition; Data mining; Feature extraction; Hidden Markov models; Humans; Loudspeakers; Space technology; Speech recognition; Streaming media; Vector quantization; Hidden Markov Models; Speech Recognition; Vector Quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2008. ICIP 2008. 15th IEEE International Conference on
Conference_Location :
San Diego, CA
ISSN :
1522-4880
Print_ISBN :
978-1-4244-1765-0
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2008.4712005
Filename :
4712005
Link To Document :
بازگشت