Title :
Refining Web authoritative resource by frequent structures
Author :
Zhou, Haofeng ; Lou, Yubo ; Yuan, Qingqing ; Ng, Wilfred ; Wang, Wei ; Shi, Baile
Author_Institution :
Dept. of Comput. & Inf. Technol., Fudan Univ., Shanghai, China
Abstract :
The Web resource is a rich collection of the dynamic information, which is useful in various disciplines. There has also been much research work related to improving the quality of information searching in the Web. However, most of the work is still inadequate to satisfy a diversified demand from users. In this paper, we exploit the hyperlinks in the Web and propose a new approach called SFP in order to improve the quality of research results obtain from search engines. The SFP algorithm evolves from the frequent pattern mining technique, which is a common data mining technique for conventional databases. The essential idea of our approach is to mine the frequent structures of links from a given Web topology. By using the SFP algorithm, we extract the authoritative pages and communities from the complex Web topology. We demonstrate our approach by running several experiments and show that the performance and functionalities of using the SFP in managing search results are better than other known methods such as HITS.
Keywords :
Internet; data mining; data structures; information resources; query processing; search engines; topology; HITS; SFP algorithm; WWW; Web authoritative resource refinement; Web hyperlink; Web topology; World Wide Web; authoritative page extraction; data mining; dynamic information; frequent pattern mining; frequent structure mining; global information service; information searching; research result quality improvement; search engine; search result management; web link structure; Computer science; Data mining; Databases; Information technology; Navigation; Search engines; Topology; Web pages; Web sites; World Wide Web;
Conference_Titel :
Database Engineering and Applications Symposium, 2003. Proceedings. Seventh International
Print_ISBN :
0-7695-1981-4
DOI :
10.1109/IDEAS.2003.1214934