مرکز منطقه ای اطلاع رساني علوم و فناوري - Complexity reduction in a large vocabulary speech recognizer

DocumentCode :

1933653

Title :

Complexity reduction in a large vocabulary speech recognizer

Author :

Pieraccini, Roberto ; Lee, Chin-Hui ; Giachin, Egidio ; Rabiner, Lawrence R.

Author_Institution :

AT&T Bell Labs., Murray Hill, NJ, USA

fYear :

1991

fDate :

14-17 Apr 1991

Firstpage :

729

Abstract :

The authors provide a detailed description of all aspects of the implementation of a large-vocabulary speaker-independent, continuous speech recognizer used as a tool for the development of recognition algorithms based on hidden Markov models (HMMs) and Viterbi decoding. The complexity of HMM recognizers is greatly increased by the introduction of detailed context-dependent units for representing interword coarticulation. A vectorized representation of the data structures involved in the decoding process, along with compilation of the connection information among temporally consecutive words and an efficient implementation of the beam search pruning, has led to a speedup of the algorithm of about one order of magnitude. A guided search can be used during a tuning phase for obtaining a speedup of more than three times. An average recognition time of about 25 s per sentence, although far from real time, allows one to perform a series of training experiments and to tune the recognition system parameters in order to obtain high word accuracy on complex recognition tasks such as the DARPA resource management task

Keywords :

Markov processes; computational complexity; decoding; speech recognition; 25 s; DARPA resource management task; HMM; Viterbi decoding; beam search pruning; computational complexity; connection information; context-dependent units; data structures; guided search; hidden Markov models; high word accuracy; interword coarticulation; large vocabulary speech recognizer; recognition algorithms; recognition time; speaker independent speech recognition; temporally consecutive words; tuning phase; vectorized representation; Computational efficiency; Context modeling; Decoding; Hidden Markov models; Logic; Speech recognition; Tail; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location :

Toronto, Ont.

ISSN :

1520-6149

Print_ISBN :

0-7803-0003-3

Type :

conf

DOI :

10.1109/ICASSP.1991.150443

Filename :

150443

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1933653