• DocumentCode
    337471
  • Title

    Using smoothed K-TSS language models in continuous speech recognition

  • Author

    Varona, A. ; Torres, I.

  • Author_Institution
    Dept. de Electr. y Electron., Pais Vasco Univ., Bilbao, Spain
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    729
  • Abstract
    A syntactic approach of the well-known N-grams models, the K-testable language in the strict sense (K-TSS), is used in this work to be integrated in a continuous speech recognition (CSR) system. The use of smoothed K-TSS regular grammars allowed to obtain a deterministic stochastic finite state automaton (SFSA) integrating K k-TSS models into a self-contained model. An efficient representation of the whole model in a simple array of adequate size is proposed. This structure can be easily handled at decoding time by a simple search function through the array. This formulation strongly reduced the number of parameters to be managed and thus the computing complexity of the model. An experimental evaluation of the proposed SFSA representation was carried out over an Spanish recognition task. These experiments showed important memory saving to allocate K-TSS language models, more important for higher values of K. They also showed that the decoding time did not meaningfully increased when K did. The lower word error rates for the Spanish task tested were achieved for K=4 and 5. As a consequence the ability of this syntactic approach of the N-grams to be well integrated in a CSR system, even for high values of K, has been established
  • Keywords
    computational complexity; finite automata; grammars; natural languages; speech recognition; stochastic processes; K-testable language; N-grams models; Spanish recognition; array; computing complexity; continuous speech recognition; decoding time; deterministic stochastic finite state automaton; experiments; regular grammars; search function; self-contained model; smoothed K-TSS language models; strict sense; syntactic approach; word error rates; Automata; Error analysis; Maximum likelihood decoding; Natural languages; Probability distribution; Smoothing methods; Speech recognition; Statistical analysis; Stochastic processes; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759770
  • Filename
    759770