DocumentCode :
2580479
Title :
Metadata based Web mining for relevance
Author :
Yi, Jeonghee ; Sundaresan, Neel
Author_Institution :
Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
fYear :
2000
fDate :
2000
Firstpage :
113
Lastpage :
121
Abstract :
This paper presents a relevant term discoverer, a system that discovers relevant topics of a given topic from the World Wide Web. The system mines hyperlink metadata on the basis of the association of terms in the metadata. It also applies various filtering techniques to detect false positives and false negatives. The applications of the system include: i) topic-specific information gathering systems that need to crawl resources of the relevant topic, ii) bibliography search system that need to extend their search to the articles of relevant topics, iii) classification systems that can categorize items of similar class together, and so on. We report a successful application of the system to build a topic-specific search-engine dedicated to eXtensible Markup Language (XML). Using the algorithms presented in this paper, we were able to identify the relevant topics that the search engine needs to cover. Together with effective topic-directed crawling algorithms, we were able to build a topic-specific search engine that require significantly less human labor but perform almost as well as topic-specific search engines whose content is maintained by humans
Keywords :
data mining; information resources; meta data; relevance feedback; search engines; Web mining; World Wide Web; filtering techniques; hyperlink metadata; metadata; relevance; relevant term discoverer; search engine; topic-specific search-engine; Computer science; Data mining; Graphics; Humans; Search engines; Web mining; Web pages; Web sites; World Wide Web; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Engineering and Applications Symposium, 2000 International
Conference_Location :
Yokohama
Print_ISBN :
0-7695-0789-1
Type :
conf
DOI :
10.1109/IDEAS.2000.880569
Filename :
880569
Link To Document :
بازگشت