DocumentCode :
3131575
Title :
Improved speech recognition using adaptive audio-visual fusion via a stochastic secondary classifier
Author :
Lucey, Simon ; Sridharan, Sridha ; Chandran, Vinod
Author_Institution :
Sch. of Electr. & Electron. Syst. Eng., Queensland Univ. of Technol., Brisbane, Qld., Australia
fYear :
2001
fDate :
2001
Firstpage :
551
Lastpage :
554
Abstract :
The adaptive fusion of video and audio is one of the fundamental pursuits of audio visual speech recognition (AVSR). In this paper the use of a high dimensional secondary classifier on the word likelihood scores from both the audio and video modalities is investigated for the purposes of adaptive fusion. Results are presented that lie above or equal to the boundary of catastrophic fusion across a number of audio noise levels
Keywords :
sensor fusion; speech recognition; video signal processing; adaptive audio-visual fusion; audio noise levels; audio visual speech recognition; high dimensional secondary classifier; stochastic secondary classifier; word likelihood scores; Australia; Degradation; Dispersion; Laboratories; Noise level; Noise measurement; Speech recognition; Stochastic processes; Systems engineering and theory; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Multimedia, Video and Speech Processing, 2001. Proceedings of 2001 International Symposium on
Conference_Location :
Hong Kong
Print_ISBN :
962-85766-2-3
Type :
conf
DOI :
10.1109/ISIMP.2001.925455
Filename :
925455
Link To Document :
بازگشت