• DocumentCode
    290359
  • Title

    An algorithm of high resolution and efficient multiple string hypothesization for continuous speech recognition using inter-word models

  • Author

    Chou, W. ; Matsuoka, T. ; Juang, B.H. ; Lee, C.H.

  • Author_Institution
    Dept. of Res. Technol., AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    ii
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    We propose a new accurate string hypothesization algorithm to find the N-best multiple string hypotheses in continuous speech recognition. The algorithm differs from the conventional N-best search algorithms in that it allows the use of the same set of long term language model scores and the detailed context-dependent subword models such as inter-word context dependent triphone models in both forward and backward search for high performance speech recognition. It is an extension of the tree-trellis N-best search algorithm[1]. The inter-word context dependency is exactly preserved in both forward partial path map preparation and the proposed backward N-best multiple string hypothesis tree search. The search efficiency is maximized by applying the same high resolution acoustic and language models in both search directions. When search heuristics are used, the proposed approach provides a more accurate string model matching than that of the conventional frame-synchronous Viterbi beam search decoder
  • Keywords
    natural languages; search problems; speech recognition; tree searching; backward search; context dependent triphone models; context-dependent subword models; continuous speech recognition; forward search; high resolution acoustic models; high resolution language models; inter-word models; long term language model scores; multiple string hypotheses; search efficiency; search heuristics; string hypothesization algorithm; string model matching; tree-trellis N-best search algorithm; Acoustic beams; Context modeling; Decoding; Humans; Laboratories; Natural languages; Protection; Speech recognition; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389696
  • Filename
    389696