DocumentCode
349619
Title
Discriminative training based on frame level likelihood normalization and its application for speech and speaker recognition
Author
Markov, K.P. ; Hanai, K. ; Nakagawa, S.
Author_Institution
Res. Eng. Dept., ATR-I, Kyoto, Japan
Volume
1
fYear
1999
fDate
1999
Firstpage
532
Abstract
We present a method for discriminative estimation of parameters of Gaussian distribution based classifiers and its application to speech and speaker recognition. The objective of this method is to maximize the normalized likelihood of the design samples. In contrast to other discriminative algorithms such as minimum classification error/generalized probabilistic descent (MCE/GPD) and maximum mutual information (MMI), the objective function is optimized using a modified expectation-maximization (EM) algorithm. The evaluation experiments using both clean and telephone speech showed improvement of the recognition rates compared to the maximum likelihood estimation (MLE) training method, especially when the mismatch between the training and testing conditions is significant. Compared with the MCE/GPD discriminative method, our algorithm showed better performance in both the speech and speaker recognition tasks
Keywords
Gaussian distribution; maximum likelihood estimation; speaker recognition; Gaussian distribution based classifiers; clean speech; discriminative estimation; discriminative training; frame level likelihood normalization; generalized probabilistic descent; maximum likelihood estimation; maximum mutual information; minimum classification error; modified expectation-maximization algorithm; normalized likelihood; objective function; recognition rates; telephone speech; Application software; Maximum likelihood estimation; Mutual information; Parameter estimation; Probability; Speaker recognition; Speech analysis; Speech recognition; Telephony; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location
Tokyo
ISSN
1062-922X
Print_ISBN
0-7803-5731-0
Type
conf
DOI
10.1109/ICSMC.1999.814148
Filename
814148
Link To Document