• DocumentCode
    323782
  • Title

    Improved search strategy for large vocabulary continuous Mandarin speech recognition

  • Author

    Ho, Tai-Hsuan ; Yang, Kae-Cherng ; Huang, Kuo-Hsun ; Lee, Lin-shan

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    825
  • Abstract
    This paper presents a new search strategy for large vocabulary continuous Mandarin speech recognition considering the special structure of the Chinese language. This strategy is composed of forward and backward passes, between which a high-quality syllable lattice is generated to bridge the syllable-level and word-level decoding processes. In the forward pass, considering the small number of syllables in the Chinese language, a frame-synchronous stack decoder is used to integrate the high-order syllable N-Gram language model, so as to generate a very accurate and compact syllable lattice. In the backward pass, considering the special monosyllabic wording structure in the Chinese language, the search space for the word-level decoding is expanded dynamically from the syllable lattice, and the best word sequence is extracted based on the knowledge provided by the word pronunciation lexicon and the word N-Gram language model. In the preliminary experiments, it was found that, with this strategy, the character error rate can be reduced by more than 20% as compared with a previous system using syllable-aligned lattice approach on a speaker-adaptive continuous speech recognition task
  • Keywords
    decoding; search problems; speech recognition; Chinese language; Mandarin speech recognition; backward pass; character error rate; forward pass; frame-synchronous stack decoder; high-order syllable N-Gram language model; high-quality syllable lattice; large vocabulary continuous speech recognition; monosyllabic wording structure; search strategy; syllable-level decoding process; word pronunciation lexicon; word-level decoding process; Acoustic beams; Bridges; Computer science; Decoding; Error analysis; Information science; Lattices; Natural languages; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675392
  • Filename
    675392