Title :
Wordspotter training using figure-of-merit back propagation
Author :
Lippmann, Richard P. ; Chang, Eric I. ; Jankowski, Charles R.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
A new approach to wordspotter training is presented which directly maximizes the figure of merit (FOM) defined as the average detection rate over a specified range of false alarm rates. This systematic approach to discriminant training for wordspotters eliminates the necessity of ad hoc thresholds and tuning. It improves the FOM of wordspotters tested using cross-validation on the credit-card speech corpus training conversations by 4 to 5 percentage points to roughly 70%. This improved performance requires little extra complexity during wordspotting and only two extra passes through the training data during training. The FOM gradient is computed analytically for each putative hit, back-propagated through HMM word models using the Viterbi alignment, and used to adjust RBF hidden node centers and state-weights associated with every node in HMM keyword models
Keywords :
backpropagation; hidden Markov models; optimisation; speech recognition; HMM word models; RBF hidden node centers; Viterbi alignment; average detection rate; complexity; credit-card speech corpus training conversations; cross-validation; discriminant training; false alarm rates; figure-of-merit back propagation; state-weights; wordspotter training; Covariance matrix; Density functional theory; Filter bank; Government; Hidden Markov models; Laboratories; Speech; Testing; Training data; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389274