DocumentCode :
498897
Title :
English-Chinese OOV translation based on PAT Tree
Author :
Wang, Yang ; Zhang, Yue-jie ; Zhang, Tao
Author_Institution :
Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
Volume :
3
fYear :
2009
fDate :
12-15 July 2009
Firstpage :
1732
Lastpage :
1736
Abstract :
In Cross-Language Information Retrieval (CLIR) process, Out-Of-Vocabulary (OOV) or the unknown word translation is a significant and challenging issue. Specifically, for English-Chinese OOV translation, OOV term detection and extraction of translation pair still remain to be key problems. In this paper, an English-Chinese OOV translation pattern based on PAT-Tree is proposed. Web-mining is utilized as the corpus source to collect translation pairs, and translation candidates are acquired by Chinese OOV term extraction based on PAT-Tree. The experimental results show that the proposed approach can outperform some of the current translation engines, and is especially efficient in English-Chinese OOV translation.
Keywords :
Internet; data mining; information retrieval; language translation; search engines; English-Chinese out-of-vocabulary translation; OOV term detection; OOV term extraction; Web-mining; cross-language information retrieval; translation engines; Cybernetics; Machine learning; Cross-Language Information Retrieval (CLIR); English-Chinese OOV translation; Out-of-Vocabulary (OOV); PAT-Tree; term extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2009 International Conference on
Conference_Location :
Baoding
Print_ISBN :
978-1-4244-3702-3
Electronic_ISBN :
978-1-4244-3703-0
Type :
conf
DOI :
10.1109/ICMLC.2009.5212280
Filename :
5212280
Link To Document :
بازگشت