DocumentCode :
542313
Title :
Maximum entropy and MCE based HMM stream weight estimation for audio-visual ASR
Author :
Gravier, Guillaume ; Axelrod, Scott ; Potamianos, Gerasimos ; Neti, Chalapathy
Author_Institution :
IBM T. J. Watson Research Center, PO Box 218, Yorktown Heights, NY 10598, USA
Volume :
1
fYear :
2002
fDate :
13-17 May 2002
Abstract :
In this paper, we propose a new fast and flexible algorithm based on the maximum entropy (MAXENT) criterion to estimate stream weights in a state-synchronous multi-stream HMM. The technique is compared to the minimum classification error (MCE) criterion and to a brute-force, grid-search optimization of the WER on both a small and a large vocabulary audio-visual continuous speech recognition task. When estimating global stream weights, the MAXENT approach gives comparable results to the grid-search and the MCE. Estimation of state dependent weights is also considered: We observe significant improvements in both the MAXENT and MCE criteria, which, however, do not result in significant WER gains.
Keywords :
Biological system modeling; Clustering algorithms; Estimation; Gold; Hidden Markov models; Signal to noise ratio; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5743873
Filename :
5743873
Link To Document :
بازگشت