DocumentCode
2456594
Title
Comparing Different Approaches to Treat Translation Ambiguity in CLIR: Structured Queries vs. Target Co-occurrence Based Selection
Author
Saralegi, Xabier ; Lopez de Lacalle, M.
Author_Institution
R&D Elhuyar Found., Usurbil, Spain
fYear
2009
fDate
Aug. 31 2009-Sept. 4 2009
Firstpage
398
Lastpage
404
Abstract
Two main problems in Cross-language Information Retrieval are translation selection and the treatment of out-of-vocabulary terms. In this paper, we will be focusing on the problem concerning the translation selection. Structured queries and target co-occurrence-based methods seem to be the most appropriate approaches when parallel corpora are not available. However, there is no comparative study. In this paper we compare the results obtained using each of the aforementioned methods, we specify the weaknesses of each method, and finally we propose a hybrid method to combine both. In terms of mean average precision, results for Basque-English cross-lingual retrieval show that structured queries are the best approach both with long queries and short queries.
Keywords
language translation; natural language processing; query processing; Basque-English cross-lingual retrieval; cross-language information retrieval; mean average precision; out-of-vocabulary terms; structured queries; target co-occurrence based selection; translation selection; Algorithm design and analysis; Context; Databases; Dictionaries; Expert systems; Information retrieval; Natural languages; Research and development; Statistics; Crosslingual information retrieval; co-occurrence statistics; structured query translation;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Application, 2009. DEXA '09. 20th International Workshop on
Conference_Location
Linz
ISSN
1529-4188
Print_ISBN
978-0-7695-3763-4
Type
conf
DOI
10.1109/DEXA.2009.58
Filename
5337087
Link To Document