DocumentCode :
1860664
Title :
Adaptive bimodal sensor fusion for automatic speechreading
Author :
Meier, Uwe ; Hürst, Wolfgang ; Duchnowski, Paul
Author_Institution :
Interactive Syst. Labs., Karlsruhe Univ., Germany
Volume :
2
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
833
Abstract :
We present work on improving the performance of automated speech recognizers by using additional visual information: (lip-/speechreading); achieving error reduction of up to 50%. This paper focuses on different methods of combining the visual and acoustic data to improve the recognition performance. We show this on an extension of an existing state-of-the-art speech recognition system, a modular MS-TDNN. We have developed adaptive combination methods at several levels of the recognition network. Additional information such as estimated signal-to-noise ratio (SNR) is used in some cases. The results of the different combination methods are shown for clean speech and data with artificial noise (white, music, motor). The new combination methods adapt automatically to varying noise conditions making hand-tuned parameters unnecessary
Keywords :
acoustic signal processing; adaptive signal processing; image processing; multilayer perceptrons; sensor fusion; speech recognition; SNR; acoustic data; adaptive bimodal sensor fusion; adaptive combination methods; artificial noise; automated speech recognizer performance; automatic speechreading; clean speech; error reduction; lip reading; modular MS-TDNN; motor; music; noise conditions; recognition network; signal-to-noise ratio; speech recognition system; visual data; visual information; white noise; Acoustic noise; Acoustic testing; Background noise; Interactive systems; Loudspeakers; Sensor fusion; Signal to noise ratio; Speech enhancement; Speech recognition; White noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.543250
Filename :
543250
Link To Document :
بازگشت