DocumentCode :
3413726
Title :
Multimodal information fusion using the iterative decoding algorithm and its application to audio-visual speech recognition
Author :
Shivappa, Shankar T. ; Rao, Bhaskar D. ; Trivedi, Mohan M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, La Jolla, CA
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
2241
Lastpage :
2244
Abstract :
The fusion of information from heterogenous sensors is crucial to the effectiveness of a multimodal system. Noise affect the sensors of different modalities independently. A good fusion scheme should be able to use local estimates of the reliability of each modality to weight the decisions. This paper presents an iterative decoding based information fusion scheme motivated by the theory of turbo codes. This fusion framework is developed in the context of hidden Markov models. We present the mathematical framework of the fusion scheme. We then apply this algorithm to an audio-visual speech recognition task on the GRID audio-visual speech corpus and present the results.
Keywords :
hidden Markov models; iterative decoding; sensor fusion; speech coding; speech recognition; turbo codes; GRID audio-visual speech corpus; audio-visual speech recognition; heterogenous sensors; hidden Markov models; iterative decoding algorithm; multimodal information fusion; turbo codes; Acoustic noise; Application software; Drives; Feature extraction; Hidden Markov models; Iterative algorithms; Iterative decoding; Multimodal sensors; Sensor fusion; Speech recognition; Hidden Markov models; Iterative decoding; Multimedia systems; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518091
Filename :
4518091
Link To Document :
بازگشت