Title :
Iterative Mining Translations from the Web
Author :
Li, Fang ; Yuan, Shuangqing ; Sheng, Huanye
Author_Institution :
Dept. of Computer Science and Engineering, Shanghai Jiao Tong University
Abstract :
Multilingual translations play a vital role multilingual or cross-lingual information retrieval and extraction. In this paper, we describe a new method mine translations from bilingual web pages based on our former research. Two new features are introduced, one is the iterative mining process in order to increase the number of translation pairs; the other is the filtering step which deletes language-specific prefix and postfix in hyperlinks. Experiments show that the precision has been greatly improved due to the filtering step and the number of translation pairs increased after six iterations.
Keywords :
Computer science; Data mining; Humans; IP networks; Information filtering; Information filters; Information retrieval; Iterative methods; Uniform resource locators; Web pages;
Conference_Titel :
Web Information Retrieval and Integration, 2005. WIRI '05. Proceedings. International Workshop on Challenges in
Print_ISBN :
0-7695-2414-1
DOI :
10.1109/WIRI.2005.24