DocumentCode :
290050
Title :
Wordspotter training using figure-of-merit back propagation
Author :
Lippmann, Richard P. ; Chang, Eric I. ; Jankowski, Charles R.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
A new approach to wordspotter training is presented which directly maximizes the figure of merit (FOM) defined as the average detection rate over a specified range of false alarm rates. This systematic approach to discriminant training for wordspotters eliminates the necessity of ad hoc thresholds and tuning. It improves the FOM of wordspotters tested using cross-validation on the credit-card speech corpus training conversations by 4 to 5 percentage points to roughly 70%. This improved performance requires little extra complexity during wordspotting and only two extra passes through the training data during training. The FOM gradient is computed analytically for each putative hit, back-propagated through HMM word models using the Viterbi alignment, and used to adjust RBF hidden node centers and state-weights associated with every node in HMM keyword models
Keywords :
backpropagation; hidden Markov models; optimisation; speech recognition; HMM word models; RBF hidden node centers; Viterbi alignment; average detection rate; complexity; credit-card speech corpus training conversations; cross-validation; discriminant training; false alarm rates; figure-of-merit back propagation; state-weights; wordspotter training; Covariance matrix; Density functional theory; Filter bank; Government; Hidden Markov models; Laboratories; Speech; Testing; Training data; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389274
Filename :
389274
Link To Document :
بازگشت