DocumentCode :
2281943
Title :
Employing topic modeling for statistical machine translation
Author :
Zhengxian, Gong ; Guodong, Zhou
Author_Institution :
Sch. of Comput. Sci. & Technol., Soochow Univ., Suzhou, China
Volume :
4
fYear :
2011
fDate :
10-12 June 2011
Firstpage :
24
Lastpage :
28
Abstract :
The mixture modeling approaches have dominated the research of domain adaptation in Statistical Machine Translation (SMT). Such approaches construct a general model and several sub-models in advance and focus on the way of determining the relative importance of all the models. In this paper, we propose a simple yet effective approach for better domain adaptation in phrase-based SMT via topic modeling. Different from existing approaches, our topic modeling approach employs one additional feature function to capture the topic inherent in the source phrase and help the decoder dynamically choose related target phrases according to the specific topic of the source phrase. Evaluation on a conversation corpus shows very encouraging results.
Keywords :
language translation; statistical analysis; conversation corpus; mixture modeling approaches; phrase-based SMT; source phrase; statistical machine translation; topic modeling approach; Adaptation models; Analytical models; Business; Computational modeling; Decoding; Hidden Markov models; Training; domain adaptation; statistical macine translation; topic modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Automation Engineering (CSAE), 2011 IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-8727-1
Type :
conf
DOI :
10.1109/CSAE.2011.5952796
Filename :
5952796
Link To Document :
بازگشت