DocumentCode :
2720083
Title :
A Thesaurus Construction Method from Large ScaleWeb Dictionaries
Author :
Nakayama, Keisuke ; Hara, Takahiro ; Nishio, Shojiro
Author_Institution :
Dept. of Multimedia Eng., Osaka Univ., Suita
fYear :
2007
fDate :
21-23 May 2007
Firstpage :
932
Lastpage :
939
Abstract :
Web-based dictionaries, such as Wikipedia, have become dramatically popular among the Internet users in past several years. The important characteristic of Web-based dictionary is not only the huge amount of articles, but also hyperlinks. Hyperlinks have various information more than just providing transfer function between pages. In this paper, we propose an efficient method to analyze the link structure of Web-based dictionaries to construct an association thesaurus. We have already applied it to Wikipedia, a huge scale Web-based dictionary which has a dense link structure, as a corpus. We developed a search engine for evaluation, then conducted a number of experiments to compare our method with other traditional methods such as cooccurrence analysis.
Keywords :
Internet; dictionaries; thesauri; Internet; Wikipedia; cooccurrence analysis; dense link structure; large scale Web dictionaries; thesaurus construction method; Dictionaries; Frequency; Humans; Information science; Internet; Large-scale systems; Natural languages; Thesauri; Transfer functions; Wikipedia;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Information Networking and Applications, 2007. AINA '07. 21st International Conference on
Conference_Location :
Niagara Falls, ON
ISSN :
1550-445X
Print_ISBN :
0-7695-2846-5
Type :
conf
DOI :
10.1109/AINA.2007.23
Filename :
4220991
Link To Document :
بازگشت