Title :
Interactive query learning for isolated speech recognition
Author :
Hwang, Jenq-Neng ; Li, Hang
Author_Institution :
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
fDate :
31 Aug-2 Sep 1992
Abstract :
The authors propose an interactive query learning approach to isolated speech recognition tasks. The approach starts with training multiple `one-net-one-class´ time delay neural networks (TDNNs) based on sequences of LPC vectors. After all TDNNs are trained, initiated from each available LPC training sequence for one specific TDNN (say, class k), an improved network inversion algorithm with imposing constraint is used to generate a set of inverted LPC sequences corresponding to various output values of the corresponding TDNN. By carefully listening to synthesized speech based on the inverted LPC sequences, a conjugate pair of LPC sequences is selected from the whole set of LPC sequences; one corresponds to the acceptable speech of class k and the other corresponds to the unacceptable speech of class k. This conjugate LPC sequence pair constitutes some parts of the classification boundary associated with this class, and should be further used as the training date to refine the already trained classifier boundary. A 6% accuracy improvement was achieved when the proposed method was tested on speaker independent E-set data
Keywords :
learning (artificial intelligence); neural nets; speech recognition; LPC vectors sequences; conjugate LPC sequence pair; interactive query learning; isolated speech recognition; network inversion algorithm; speaker independent E-set data; synthesized speech; time delay neural networks; training; Automatic testing; Hidden Markov models; Information processing; Laboratories; Linear predictive coding; Network synthesis; Neural networks; Speech recognition; Speech synthesis; Training data;
Conference_Titel :
Neural Networks for Signal Processing [1992] II., Proceedings of the 1992 IEEE-SP Workshop
Conference_Location :
Helsingoer
Print_ISBN :
0-7803-0557-4
DOI :
10.1109/NNSP.1992.253704