Title :
Discriminative training of dynamic programming based speech recognizers
Author :
Chang, Pao-Chung ; Juang, Biing-hwang
Author_Institution :
Telecommun. Labs., Minist. of Commun., Taiwan
fDate :
4/1/1993 12:00:00 AM
Abstract :
A new minimum recognition error formulation and a generalized probabilistic descent (GPD) algorithm are analyzed and used to accomplish discriminative training of a conventional dynamic-programming-based speech recognizer. The objective of discriminative training here is to directly minimize the recognition error rate. To achieve this, a formulation that allows controlled approximation of the exact error rate and renders optimization possible is used. The GPD method is implemented in a dynamic-time-warping (DTW)-based system. A linear discriminant function on the DTW distortion sequence is used to replace the conventional average DTW path distance. A series of speaker-independent recognition experiments using the highly confusible English E-set as the vocabulary showed a recognition rate of 84.4% compared to ~60% for traditional template training via clustering. The experimental results verified that the algorithm converges to a solution that achieves minimum error rate
Keywords :
dynamic programming; learning (artificial intelligence); speech recognition; DTW distortion sequence; discriminative training; dynamic programming; dynamic-time-warping; generalized probabilistic descent; highly confusible English E-set; linear discriminant function; minimum recognition error formulation; optimization; speaker-independent recognition; speech recognizer; Algorithm design and analysis; Artificial neural networks; Character recognition; Dynamic programming; Error analysis; Error correction; Probability distribution; Speech analysis; Speech recognition; Vocabulary;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on