مرکز منطقه ای اطلاع رساني علوم و فناوري - Improved speech recognition using adaptive audio-visual fusion via a stochastic secondary classifier

DocumentCode :

3131575

Title :

Improved speech recognition using adaptive audio-visual fusion via a stochastic secondary classifier

Author :

Lucey, Simon ; Sridharan, Sridha ; Chandran, Vinod

Author_Institution :

Sch. of Electr. & Electron. Syst. Eng., Queensland Univ. of Technol., Brisbane, Qld., Australia

fYear :

2001

fDate :

2001

Firstpage :

551

Lastpage :

554

Abstract :

The adaptive fusion of video and audio is one of the fundamental pursuits of audio visual speech recognition (AVSR). In this paper the use of a high dimensional secondary classifier on the word likelihood scores from both the audio and video modalities is investigated for the purposes of adaptive fusion. Results are presented that lie above or equal to the boundary of catastrophic fusion across a number of audio noise levels

Keywords :

sensor fusion; speech recognition; video signal processing; adaptive audio-visual fusion; audio noise levels; audio visual speech recognition; high dimensional secondary classifier; stochastic secondary classifier; word likelihood scores; Australia; Degradation; Dispersion; Laboratories; Noise level; Noise measurement; Speech recognition; Stochastic processes; Systems engineering and theory; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Multimedia, Video and Speech Processing, 2001. Proceedings of 2001 International Symposium on

Conference_Location :

Hong Kong

Print_ISBN :

962-85766-2-3

Type :

conf

DOI :

10.1109/ISIMP.2001.925455

Filename :

925455

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3131575