DocumentCode :
2789670
Title :
Language model adaptation using Random Forests
Author :
Deoras, Anoop ; Jelinek, Frederick ; Su, Yi
Author_Institution :
Center for Language & Speech Process., Johns Hopkins Univ., Baltimore, MN, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5198
Lastpage :
5201
Abstract :
In this paper we investigate random forest based language model adaptation. Large amounts of out-of-domain data are used to grow the decision trees while very small amounts of in-domain data are used to prune them back, so that the structure of the trees are suitable for the desired domain while the probabilities in the tree nodes are reliably estimated. Extensive experiments are carried out and results are reported on a particular task of adapting Broadcast News language model to the MIT computer science lecture domain. We show 0.80% and 0.60% absolute WER improvement over language model interpolation and count merging techniques, respectively.
Keywords :
decision trees; natural language processing; speech processing; count merging technique; decision trees; language model adaptation; language model interpolation; random forest; Adaptation model; Broadcasting; Computer science; Decision trees; History; Interpolation; Merging; Natural languages; Speech processing; Training data; Adaptation; Language Modeling; Random Forests;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495012
Filename :
5495012
Link To Document :
بازگشت