• DocumentCode
    468230
  • Title

    A New Method for Cross-Language Information Retrieval by Summing Weights of Graphs

  • Author

    Yuan, Song An ; Yu, Song Nian

  • Author_Institution
    Shanghai Univ., Shanghai
  • Volume
    2
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    326
  • Lastpage
    330
  • Abstract
    Disambiguation is the aim of most translation techniques used in cross-language information retrieval. In this paper, we present a new method for query translation which only needs a bilingual dictionary and a monolingual corpus. Unlike the traditional statistical approach, our method uses co-occurrences between pairs of terms as statistical measure. By adding up all the weights of a k-complete subgraph, we can compare different combinations of target terms. The output of our method is in the form of probability distribution. Then the result is converted to the query in the target language. The method is easy to implement, and experiment shows it performs well.
  • Keywords
    dictionaries; graph theory; information retrieval; natural language processing; statistical distributions; text analysis; bilingual dictionary; cooccurrences; cross-language information retrieval; disambiguation; k-complete subgraph; monolingual corpus; probability distribution; query translation; statistical measure; Dictionaries; Distributed computing; Frequency estimation; Frequency shift keying; Fuzzy systems; Information retrieval; Natural languages; Probability distribution; Search engines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.84
  • Filename
    4406096