DocumentCode :
337468
Title :
Speech recognition experiments using multi-span statistical language models
Author :
Bellegarda, Jerome R.
Author_Institution :
Spoken Language Group, Apple Comput. Inc., Cupertino, CA, USA
Volume :
2
fYear :
1999
fDate :
15-19 Mar 1999
Firstpage :
717
Abstract :
A multi-span framework was proposed to integrate the various constraints, both local and global, that are present in the language. In this approach, local constraints are captured via n-gram language modeling, while global constraints are taken into account through the use of latent semantic analysis. The performance of the resulting multi-span language models, as measured by the perplexity, has been shown to compare favorably with the corresponding n-gram performance. This paper reports on actual speech recognition experiments, and shows that word error rate is also substantially reduced. On a subset of the Wall Street Journal speaker-independent, 20,000-word vocabulary, continuous speech task, the multi-span framework resulted in a reduction in average word error rate of up to 17%
Keywords :
grammars; speech recognition; statistical analysis; Wall Street Journal speaker-independent vocabulary; average word error rate reduction; continuous speech task; global constraints; latent semantic analysis; local constraints; multi-span statistical language models; n-gram language modeling; n-gram performance; performance; perplexity; speech recognition experiments; word error rate; Data mining; Displays; Error analysis; Frequency; History; Natural languages; Robustness; Speech recognition; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
ISSN :
1520-6149
Print_ISBN :
0-7803-5041-3
Type :
conf
DOI :
10.1109/ICASSP.1999.759767
Filename :
759767
Link To Document :
بازگشت