DocumentCode
2078076
Title
Application of maximum entropy method in Chinese-English cross language information retrieval
Author
Chen, Qin ; Liu, Lei ; Ma, Lin
Author_Institution
Sch. of Comput., Electron. & Inf., Guangxi Univ., Nanning, China
fYear
2008
fDate
22-25 Nov. 2008
Firstpage
1192
Lastpage
1195
Abstract
In cross language information retrieval, such as Chinese-English information retrieval, the query sentence often comprises some query keywords, but not a complete sentence. Because of the lack of necessary context and syntactic information in the query keywords, it is difficult to translate the query sentence accurately and return the results which are the best fit to the query. So, in Chinese-English cross language information retrieval, how to obtain effective Web pages and evaluate translation candidates are two challenging issues. In this paper, an approach based maximum entropy method (MEM) is proposed to obtain effective Web pages. For obtaining a correct translation list, we establish English-Chinese, Chinese - English special dictionary. Thus we can translate the query as accurately as possible by using bi-directional translation with disambiguation based on MEM. Experimental results demonstrate that the proposed method has a good performance in Chinese-English cross language information retrieval, and achieves 86.8% accuracy.
Keywords
Web sites; language translation; maximum entropy methods; natural language processing; query processing; Chinese-English cross language information retrieval; Web pages; maximum entropy method; query keywords; Bidirectional control; Dictionaries; Entropy; Indexing; Information retrieval; Internet; Natural languages; Software tools; Vocabulary; Web pages; Bi-directional Translation; Maximum Entropy Method; Special Dictionary;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer-Aided Industrial Design and Conceptual Design, 2008. CAID/CD 2008. 9th International Conference on
Conference_Location
Kunming
Print_ISBN
978-1-4244-3290-5
Electronic_ISBN
978-1-4244-3291-2
Type
conf
DOI
10.1109/CAIDCD.2008.4730776
Filename
4730776
Link To Document