Using smoothed K-TSS language models in continuous speech recognition

Author

Varona, A. ; Torres, I.

Author_Institution

Dept. de Electr. y Electron., Pais Vasco Univ., Bilbao, Spain

Volume

2

fYear

1999

fDate

15-19 Mar 1999

Firstpage

729

Abstract

A syntactic approach of the well-known N-grams models, the K-testable language in the strict sense (K-TSS), is used in this work to be integrated in a continuous speech recognition (CSR) system. The use of smoothed K-TSS regular grammars allowed to obtain a deterministic stochastic finite state automaton (SFSA) integrating K k-TSS models into a self-contained model. An efficient representation of the whole model in a simple array of adequate size is proposed. This structure can be easily handled at decoding time by a simple search function through the array. This formulation strongly reduced the number of parameters to be managed and thus the computing complexity of the model. An experimental evaluation of the proposed SFSA representation was carried out over an Spanish recognition task. These experiments showed important memory saving to allocate K-TSS language models, more important for higher values of K. They also showed that the decoding time did not meaningfully increased when K did. The lower word error rates for the Spanish task tested were achieved for K=4 and 5. As a consequence the ability of this syntactic approach of the N-grams to be well integrated in a CSR system, even for high values of K, has been established

Keywords

computational complexity; finite automata; grammars; natural languages; speech recognition; stochastic processes; K-testable language; N-grams models; Spanish recognition; array; computing complexity; continuous speech recognition; decoding time; deterministic stochastic finite state automaton; experiments; regular grammars; search function; self-contained model; smoothed K-TSS language models; strict sense; syntactic approach; word error rates; Automata; Error analysis; Maximum likelihood decoding; Natural languages; Probability distribution; Smoothing methods; Speech recognition; Statistical analysis; Stochastic processes; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

Conference_Location

Phoenix, AZ

ISSN

1520-6149

Print_ISBN

0-7803-5041-3

Type

conf

DOI

10.1109/ICASSP.1999.759770

Filename

759770