DocumentCode
290359
Title
An algorithm of high resolution and efficient multiple string hypothesization for continuous speech recognition using inter-word models
Author
Chou, W. ; Matsuoka, T. ; Juang, B.H. ; Lee, C.H.
Author_Institution
Dept. of Res. Technol., AT&T Bell Labs., Murray Hill, NJ, USA
Volume
ii
fYear
1994
fDate
19-22 Apr 1994
Abstract
We propose a new accurate string hypothesization algorithm to find the N-best multiple string hypotheses in continuous speech recognition. The algorithm differs from the conventional N-best search algorithms in that it allows the use of the same set of long term language model scores and the detailed context-dependent subword models such as inter-word context dependent triphone models in both forward and backward search for high performance speech recognition. It is an extension of the tree-trellis N-best search algorithm[1]. The inter-word context dependency is exactly preserved in both forward partial path map preparation and the proposed backward N-best multiple string hypothesis tree search. The search efficiency is maximized by applying the same high resolution acoustic and language models in both search directions. When search heuristics are used, the proposed approach provides a more accurate string model matching than that of the conventional frame-synchronous Viterbi beam search decoder
Keywords
natural languages; search problems; speech recognition; tree searching; backward search; context dependent triphone models; context-dependent subword models; continuous speech recognition; forward search; high resolution acoustic models; high resolution language models; inter-word models; long term language model scores; multiple string hypotheses; search efficiency; search heuristics; string hypothesization algorithm; string model matching; tree-trellis N-best search algorithm; Acoustic beams; Context modeling; Decoding; Humans; Laboratories; Natural languages; Protection; Speech recognition; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389696
Filename
389696
Link To Document