DocumentCode
1895148
Title
Continuous speech recognition using linked predictive neural networks
Author
Tebelskis, Joe ; Waibel, Alex ; Petek, Bojan ; Schmidbauer, Otto
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
1991
fDate
14-17 Apr 1991
Firstpage
61
Abstract
The authors present a large vocabulary, continuous speech recognition system based on linked predictive neural networks (LPNNs). The system uses neural networks as predictors of speech frames, yielding distortion measures which can be used by the one-stage DTW algorithm to perform continuous speech recognition. The system currently achieves 95%, 58%, and 39% word accuracy on tasks with perplexity 7, 111, and 402, respectively, outperforming several simple HMMs that have been tested. It was also found that the accuracy and speed of the LPNN can be slightly improved by the judicious use of hidden control inputs. The strengths and weaknesses of the predictive approach are discussed
Keywords
filtering and prediction theory; neural nets; speech recognition; continuous speech recognition system; distortion measures; hidden control inputs; large vocabulary; linked predictive neural networks; speech frames; task perplexity; word accuracy; Computer networks; Computer science; Couplings; Distortion measurement; Hidden Markov models; Neural networks; Nonlinear distortion; Predictive models; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150278
Filename
150278
Link To Document