An algorithm of high resolution and efficient multiple string hypothesization for continuous speech recognition using inter-word models

Author

Chou, W. ; Matsuoka, T. ; Juang, B.H. ; Lee, C.H.

Author_Institution

Dept. of Res. Technol., AT&T Bell Labs., Murray Hill, NJ, USA

Volume

ii

fYear

1994

fDate

19-22 Apr 1994

Abstract

We propose a new accurate string hypothesization algorithm to find the N-best multiple string hypotheses in continuous speech recognition. The algorithm differs from the conventional N-best search algorithms in that it allows the use of the same set of long term language model scores and the detailed context-dependent subword models such as inter-word context dependent triphone models in both forward and backward search for high performance speech recognition. It is an extension of the tree-trellis N-best search algorithm[1]. The inter-word context dependency is exactly preserved in both forward partial path map preparation and the proposed backward N-best multiple string hypothesis tree search. The search efficiency is maximized by applying the same high resolution acoustic and language models in both search directions. When search heuristics are used, the proposed approach provides a more accurate string model matching than that of the conventional frame-synchronous Viterbi beam search decoder

Keywords

natural languages; search problems; speech recognition; tree searching; backward search; context dependent triphone models; context-dependent subword models; continuous speech recognition; forward search; high resolution acoustic models; high resolution language models; inter-word models; long term language model scores; multiple string hypotheses; search efficiency; search heuristics; string hypothesization algorithm; string model matching; tree-trellis N-best search algorithm; Acoustic beams; Context modeling; Decoding; Humans; Laboratories; Natural languages; Protection; Speech recognition; Viterbi algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location

Adelaide, SA

ISSN

1520-6149

Print_ISBN

0-7803-1775-0

Type

conf

DOI

10.1109/ICASSP.1994.389696

Filename

389696