DocumentCode :
2258591
Title :
Clustered language models with context-equivalent states
Author :
Ueberla, J.P. ; Gransden, I.R.
Author_Institution :
Forum Technol., DRA Malvern, Malvern, UK
Volume :
4
fYear :
1996
fDate :
3-6 Oct 1996
Firstpage :
2060
Abstract :
A hierarchical context definition is added to an existing clustering algorithm in order to increase its robustness. The resulting algorithm, which clusters contexts and events separately, is used to experiment with different ways of defining the context a language model takes into account. The contexts range from standard bigram and trigram contexts to part of speech five-grams. Although none of the models can compete directly with a backoff trigram, they give up to 9% improvement in perplexity when interpolated with a trigram. Moreover, the modified version of the algorithm leads to a performance increase over the original version of up to 12%
Keywords :
natural languages; pattern recognition; speech processing; backoff trigram; clustered language models; clustering algorithm; context equivalent states; hierarchical context definition; part of speech five-grams; performance increase; perplexity; standard bigram; trigram contexts; Clustering algorithms; Context modeling; Convergence; Probability; Robustness; Speech; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607206
Filename :
607206
Link To Document :
بازگشت