Title :
A Thesaurus Construction Method from Large ScaleWeb Dictionaries
Author :
Nakayama, Keisuke ; Hara, Takahiro ; Nishio, Shojiro
Author_Institution :
Dept. of Multimedia Eng., Osaka Univ., Suita
Abstract :
Web-based dictionaries, such as Wikipedia, have become dramatically popular among the Internet users in past several years. The important characteristic of Web-based dictionary is not only the huge amount of articles, but also hyperlinks. Hyperlinks have various information more than just providing transfer function between pages. In this paper, we propose an efficient method to analyze the link structure of Web-based dictionaries to construct an association thesaurus. We have already applied it to Wikipedia, a huge scale Web-based dictionary which has a dense link structure, as a corpus. We developed a search engine for evaluation, then conducted a number of experiments to compare our method with other traditional methods such as cooccurrence analysis.
Keywords :
Internet; dictionaries; thesauri; Internet; Wikipedia; cooccurrence analysis; dense link structure; large scale Web dictionaries; thesaurus construction method; Dictionaries; Frequency; Humans; Information science; Internet; Large-scale systems; Natural languages; Thesauri; Transfer functions; Wikipedia;
Conference_Titel :
Advanced Information Networking and Applications, 2007. AINA '07. 21st International Conference on
Conference_Location :
Niagara Falls, ON
Print_ISBN :
0-7695-2846-5
DOI :
10.1109/AINA.2007.23