DocumentCode
1860664
Title
Adaptive bimodal sensor fusion for automatic speechreading
Author
Meier, Uwe ; Hürst, Wolfgang ; Duchnowski, Paul
Author_Institution
Interactive Syst. Labs., Karlsruhe Univ., Germany
Volume
2
fYear
1996
fDate
7-10 May 1996
Firstpage
833
Abstract
We present work on improving the performance of automated speech recognizers by using additional visual information: (lip-/speechreading); achieving error reduction of up to 50%. This paper focuses on different methods of combining the visual and acoustic data to improve the recognition performance. We show this on an extension of an existing state-of-the-art speech recognition system, a modular MS-TDNN. We have developed adaptive combination methods at several levels of the recognition network. Additional information such as estimated signal-to-noise ratio (SNR) is used in some cases. The results of the different combination methods are shown for clean speech and data with artificial noise (white, music, motor). The new combination methods adapt automatically to varying noise conditions making hand-tuned parameters unnecessary
Keywords
acoustic signal processing; adaptive signal processing; image processing; multilayer perceptrons; sensor fusion; speech recognition; SNR; acoustic data; adaptive bimodal sensor fusion; adaptive combination methods; artificial noise; automated speech recognizer performance; automatic speechreading; clean speech; error reduction; lip reading; modular MS-TDNN; motor; music; noise conditions; recognition network; signal-to-noise ratio; speech recognition system; visual data; visual information; white noise; Acoustic noise; Acoustic testing; Background noise; Interactive systems; Loudspeakers; Sensor fusion; Signal to noise ratio; Speech enhancement; Speech recognition; White noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.543250
Filename
543250
Link To Document