• DocumentCode
    2456594
  • Title

    Comparing Different Approaches to Treat Translation Ambiguity in CLIR: Structured Queries vs. Target Co-occurrence Based Selection

  • Author

    Saralegi, Xabier ; Lopez de Lacalle, M.

  • Author_Institution
    R&D Elhuyar Found., Usurbil, Spain
  • fYear
    2009
  • fDate
    Aug. 31 2009-Sept. 4 2009
  • Firstpage
    398
  • Lastpage
    404
  • Abstract
    Two main problems in Cross-language Information Retrieval are translation selection and the treatment of out-of-vocabulary terms. In this paper, we will be focusing on the problem concerning the translation selection. Structured queries and target co-occurrence-based methods seem to be the most appropriate approaches when parallel corpora are not available. However, there is no comparative study. In this paper we compare the results obtained using each of the aforementioned methods, we specify the weaknesses of each method, and finally we propose a hybrid method to combine both. In terms of mean average precision, results for Basque-English cross-lingual retrieval show that structured queries are the best approach both with long queries and short queries.
  • Keywords
    language translation; natural language processing; query processing; Basque-English cross-lingual retrieval; cross-language information retrieval; mean average precision; out-of-vocabulary terms; structured queries; target co-occurrence based selection; translation selection; Algorithm design and analysis; Context; Databases; Dictionaries; Expert systems; Information retrieval; Natural languages; Research and development; Statistics; Crosslingual information retrieval; co-occurrence statistics; structured query translation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Application, 2009. DEXA '09. 20th International Workshop on
  • Conference_Location
    Linz
  • ISSN
    1529-4188
  • Print_ISBN
    978-0-7695-3763-4
  • Type

    conf

  • DOI
    10.1109/DEXA.2009.58
  • Filename
    5337087