A New Method for Cross-Language Information Retrieval by Summing Weights of Graphs

Author

Yuan, Song An ; Yu, Song Nian

Author_Institution

Shanghai Univ., Shanghai

Volume

2

fYear

2007

fDate

24-27 Aug. 2007

Firstpage

326

Lastpage

330

Abstract

Disambiguation is the aim of most translation techniques used in cross-language information retrieval. In this paper, we present a new method for query translation which only needs a bilingual dictionary and a monolingual corpus. Unlike the traditional statistical approach, our method uses co-occurrences between pairs of terms as statistical measure. By adding up all the weights of a k-complete subgraph, we can compare different combinations of target terms. The output of our method is in the form of probability distribution. Then the result is converted to the query in the target language. The method is easy to implement, and experiment shows it performs well.

Keywords

dictionaries; graph theory; information retrieval; natural language processing; statistical distributions; text analysis; bilingual dictionary; cooccurrences; cross-language information retrieval; disambiguation; k-complete subgraph; monolingual corpus; probability distribution; query translation; statistical measure; Dictionaries; Distributed computing; Frequency estimation; Frequency shift keying; Fuzzy systems; Information retrieval; Natural languages; Probability distribution; Search engines;

fLanguage

English

Publisher

ieee

Conference_Titel

Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on

Conference_Location

Haikou

Print_ISBN

978-0-7695-2874-8

Type

conf

DOI

10.1109/FSKD.2007.84

Filename

4406096