Title :
Query Disambiguation for Cross-Language Information Retrieval Using Web Directories
Author :
Kimura, Fuminori ; Maeda, Akira ; Miyazaki, Jun ; Uemura, Shunsuke
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol.
Abstract :
Since the Web consists of documents in various domains or genres, methods used for cross-language information retrieval (CLIR) of Web documents should be independent of a particular domain. In this paper, we propose a CLIR method that uses Web directories that are available in multiple language versions (such as Yahoo). In the proposed method, feature terms are first extracted from Web documents for each category in the source and target languages. Then, one or more corresponding categories in the other language are determined beforehand by comparing similarities between categories across languages. Using these category pairs, we can resolve ambiguities in simple dictionary translations. In this paper, we propose a query disambiguation method for CLIR using Web directories. To assess the effectiveness of our method, we tested the proposed retrieval methods experimentally using English and Japanese versions of Yahoo. The results showed that the proposed method is more effective for CLIR than the previous method
Keywords :
Internet; language translation; query formulation; search engines; Web directories; Web documents; Yahoo English versions; Yahoo Japanese versions; Yahoo search engine; cross-language information retrieval; dictionary translations; multiple language versions; query disambiguation; source language; target language; Data mining; Dictionaries; Educational institutions; Information retrieval; Information science; Internet; Natural languages; Search engines; Testing; Web search;
Conference_Titel :
Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
Conference_Location :
Tokyo
Print_ISBN :
0-7695-2414-1
DOI :
10.1109/WIRI.2005.32