Title :
Improved speech recognition using adaptive audio-visual fusion via a stochastic secondary classifier
Author :
Lucey, Simon ; Sridharan, Sridha ; Chandran, Vinod
Author_Institution :
Sch. of Electr. & Electron. Syst. Eng., Queensland Univ. of Technol., Brisbane, Qld., Australia
Abstract :
The adaptive fusion of video and audio is one of the fundamental pursuits of audio visual speech recognition (AVSR). In this paper the use of a high dimensional secondary classifier on the word likelihood scores from both the audio and video modalities is investigated for the purposes of adaptive fusion. Results are presented that lie above or equal to the boundary of catastrophic fusion across a number of audio noise levels
Keywords :
sensor fusion; speech recognition; video signal processing; adaptive audio-visual fusion; audio noise levels; audio visual speech recognition; high dimensional secondary classifier; stochastic secondary classifier; word likelihood scores; Australia; Degradation; Dispersion; Laboratories; Noise level; Noise measurement; Speech recognition; Stochastic processes; Systems engineering and theory; Working environment noise;
Conference_Titel :
Intelligent Multimedia, Video and Speech Processing, 2001. Proceedings of 2001 International Symposium on
Conference_Location :
Hong Kong
Print_ISBN :
962-85766-2-3
DOI :
10.1109/ISIMP.2001.925455