Title :
Maximum entropy based Chinese-Japanese word alignment
Author :
Guo, Hongmei ; Wang, Xiaojie
Author_Institution :
Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., China
fDate :
30 Oct.-1 Nov. 2005
Abstract :
Word alignment can be used for numerous applications in natural language processing, such as lexicography, machine translation and so on. In this paper, we present an efficient context-dependent word alignment model based on maximum entropy (ME) approach. All knowledge sources are treated as feature functions in this model, such as source words, POS information and bilingual dictionary. This approach allows easy integration of more context-dependent information. We perform experiments on a Chinese-Japanese parallel corpus and the results are compared with a manually produced reference alignment. We achieve good alignment accuracy in a very noisy environment using unsupervised train method. And the effects of different features are also evaluated.
Keywords :
context-free grammars; language translation; maximum entropy methods; natural languages; Chinese-Japanese word alignment; POS information; bilingual dictionary; context-dependent information; lexicography; machine translation; maximum entropy approach; natural language processing; noisy environment; unsupervised train method; Context modeling; Dictionaries; Entropy; Machine learning; Natural language processing; Natural languages; Parameter estimation; Terminology; Testing; Working environment noise;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598777