DocumentCode :
3189880
Title :
Utility-Based Web Path Traversal Pattern Mining
Author :
Zhou, Lin ; Liu, Ying ; Wang, Jing ; Shi, Yong
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
373
Lastpage :
380
Abstract :
Web usage mining is to discover user traversal patterns of Web pages from Weblog records. Usually, a popular Website may register the Weblog records in the order of hundreds of megabytes every day, which provide rich information about the Web dynamics. Path traversal pattern mining discovers frequent sequential Web accessing patterns from Weblog databases. However, it fails to reflect the different impacts of different Web pages to different users. The difference between Web pages makes a strong impact on the decision-makings in Internet information service applications. Therefore, in this paper, we introduce "utility" into path traversal pattern mining problem. Utility is a measure of how "interesting" or "useful" a Web page is. As a result, it allows Web service providers to quantify the user preferences of different traversal paths. Two-Phase utility mining method is used to discover high utility path traversal patterns. We apply our proposed "high utility path traversal mining" algorithm on a real-world Weblog database, and compare the high utility path traversal patterns with the frequent traversal patterns by a traditional path traversal method. We demonstrated the interesting paths, as well as their significance to the decision making process.
Keywords :
Conferences; Data mining; Databases; Decision making; Information analysis; Uniform resource locators; Web and internet services; Web pages; Web server; Web services;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
Type :
conf
DOI :
10.1109/ICDMW.2007.72
Filename :
4476694
Link To Document :
بازگشت