DocumentCode :
1933653
Title :
Complexity reduction in a large vocabulary speech recognizer
Author :
Pieraccini, Roberto ; Lee, Chin-Hui ; Giachin, Egidio ; Rabiner, Lawrence R.
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
729
Abstract :
The authors provide a detailed description of all aspects of the implementation of a large-vocabulary speaker-independent, continuous speech recognizer used as a tool for the development of recognition algorithms based on hidden Markov models (HMMs) and Viterbi decoding. The complexity of HMM recognizers is greatly increased by the introduction of detailed context-dependent units for representing interword coarticulation. A vectorized representation of the data structures involved in the decoding process, along with compilation of the connection information among temporally consecutive words and an efficient implementation of the beam search pruning, has led to a speedup of the algorithm of about one order of magnitude. A guided search can be used during a tuning phase for obtaining a speedup of more than three times. An average recognition time of about 25 s per sentence, although far from real time, allows one to perform a series of training experiments and to tune the recognition system parameters in order to obtain high word accuracy on complex recognition tasks such as the DARPA resource management task
Keywords :
Markov processes; computational complexity; decoding; speech recognition; 25 s; DARPA resource management task; HMM; Viterbi decoding; beam search pruning; computational complexity; connection information; context-dependent units; data structures; guided search; hidden Markov models; high word accuracy; interword coarticulation; large vocabulary speech recognizer; recognition algorithms; recognition time; speaker independent speech recognition; temporally consecutive words; tuning phase; vectorized representation; Computational efficiency; Context modeling; Decoding; Hidden Markov models; Logic; Speech recognition; Tail; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150443
Filename :
150443
Link To Document :
بازگشت