DocumentCode :
2665047
Title :
A hierarchical approach for better estimation of unseen event likelihood in speech recognition
Author :
Zitouni, Imed ; Zhou, Qiru ; Li, Qi Peter
Author_Institution :
Lucent Technol. Bell Labs., Murray Hill, NJ, USA
fYear :
2003
fDate :
26-29 Oct. 2003
Firstpage :
357
Lastpage :
361
Abstract :
The backoff hierarchical class n-gram language models (LMs) are a generalization of the common backoff word n-gram LMs. Compared to the traditional backoff word n-gram LMs that uses (n-1)-gram to estimate the likelihood of an unseen n-gram event, backoff hierarchical class n-gram LMs uses a class hierarchy to define an appropriate context. We study the impact of the hierarchy depth on the performance of the approach. Performance is evaluated on several databases such us switchboard, call-home and wall street journal (WSJ). Results show that better improvement is achieved when a shallow word (few levels) tree is used. Experiments show up to 26% improvement on the unseen events perplexity and up to 12% improvement in the word error rate (WER).
Keywords :
audio databases; maximum likelihood estimation; speech recognition; WSJ database; backoff hierarchical class n-gram language model; call-home database; class hierarchy; speech recognition; switchboard database; unseen n-gram event; wall street journal database; word error rate; Context modeling; Databases; Error analysis; Frequency estimation; Speech recognition; Training data; USA Councils; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
Type :
conf
DOI :
10.1109/NLPKE.2003.1275931
Filename :
1275931
Link To Document :
بازگشت