DocumentCode
389736
Title
Finding terminology translations from hyperlinks on the Internet
Author
Yuan, Shuang-qing ; Li, Fang ; Sheng, Huan-Ye
Author_Institution
Dept. of Comput. Sci., Shanghai Jiao Tong Univ., China
Volume
1
fYear
2002
fDate
2002
Firstpage
533
Abstract
In this paper, we describe a novel method to find terminology translations from hyperlinks between bilingual homepages on the Internet. The recognition of terminology and its translation is according to the similarities of their hyperlinks. A hyperlink can be regarded as a vector. The similarity of two vectors is calculated based on the Dice coefficient. Experimental results show that the method is reasonable and useful, and can be applied to any language pairs and domains for multilingual information retrieval and extraction.
Keywords
Internet; Web sites; data mining; language translation; natural languages; nomenclature; Dice coefficient; Internet; Web mining; bilingual homepages; bilingual terminology extraction; hyperlinks; language pairs; multilingual information retrieval; terminology recognition; terminology translations; unparallel corpus extraction; vector similarity; Computer science; Data mining; Electronic mail; Information retrieval; Internet; Libraries; Natural languages; Terminology; Web mining; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN
0-7803-7508-4
Type
conf
DOI
10.1109/ICMLC.2002.1176813
Filename
1176813
Link To Document