DocumentCode :
420202
Title :
Translating unknown cross-lingual queries in digital libraries using a Web-based approach
Author :
Wang, Jenq-Haur ; Teng, Jei-Wen ; Cheng, Pu-Jen ; Lu, Wen-Hsiang ; Chien, Lee-Feng
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taiwan
fYear :
2004
fDate :
7-11 June 2004
Firstpage :
108
Lastpage :
116
Abstract :
Users´ cross-lingual queries to a digital library system might be short and not included in a common translation dictionary (unknown terms). In this paper, we investigate the feasibility of exploiting the Web as the corpus source to translate unknown query terms for cross-language information retrieval (CLIR) in digital libraries. We propose a Web-based term translation approach to determine effective translations for unknown query terms by mining bilingual search-result pages obtained from a real Web search engine. This approach can enhance the construction of a domain-specific bilingual lexicon and benefit CLIR services in a digital library that only has monolingual document collections. Very promising results have been obtained in generating effective translation equivalents for many unknown terms, including proper nouns, technical terms and Web query terms.
Keywords :
Internet; digital libraries; query processing; search engines; Web search engine; Web-based approach; bilingual search-result page mining; cross-language information retrieval; digital library; domain-specific bilingual lexicon; monolingual document collection; translation dictionary; user cross-lingual query; Computer science; Data mining; Dictionaries; Information management; Information retrieval; Information science; Natural languages; Search engines; Software libraries; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE Conference on
Print_ISBN :
1-58113-832-6
Type :
conf
DOI :
10.1109/JCDL.2004.1336107
Filename :
1336107
Link To Document :
بازگشت