DocumentCode
323754
Title
Restructuring Gaussian mixture density functions in speaker-independent acoustic models
Author
Nakamura, Atsushi
Author_Institution
ATR Interpreting Telephony Res. Labs., Kyoto, Japan
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
649
Abstract
In continuous speech recognition featuring hidden Markov model (HMM), word N-gram and time-synchronous beam search, a local modeling mismatch in the HMM will often cause the recognition performance to degrade. To cope with this problem, this paper proposes a method of restructuring Gaussian mixture PDFs in a pre-trained speaker-independent HMM based on speech data. In this method, mixture components are copied and shared among multiple mixture PDFs with the tendency of local errors taken into account. The tendency is given by comparing the pre-trained HMM and speech data which was used in the pre-training. Experimental results prove that the proposed method can effectively restore local modeling mismatches and improve the recognition performance
Keywords
Gaussian processes; acoustic signal processing; grammars; hidden Markov models; probability; search problems; speech recognition; Gaussian mixture PDF restructuring; Gaussian mixture density functions; continuous speech recognition; experimental results; hidden Markov model; local errors; local modeling mismatch; pre-trained speaker-independent HMM; recognition performance; speaker-independent acoustic models; speech data; time-synchronous beam search; word N-gram; Acoustic beams; Deformable models; Degradation; Hidden Markov models; Loudspeakers; Speech recognition; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675348
Filename
675348
Link To Document