Title :
Web Usage Data Clustering Using Dbscan Algorithm and Set Similarities
Author :
Santhisree, K. ; Damodaram ; Appaji, S. ; NagarjunaDevi, D.
Author_Institution :
CSE, JNTU, Hyderabad, India
Abstract :
Web usage mining is the application of data mining techniques to web log data repositories. It is used in finding the user access patterns from web access log. User page visits are sequential in nature. In this paper we presented new Rough set Dbscan clustering algorithm which identifies the behavior of the users page visits, order of occurrence of visits. Web data Clusters are formed using the rough set Similarity Upper Approximations. We present the experimental results on MSNBC web navigation dataset, and proved that Rough set Dbscan clustering has better efficiency and performance clustering in web usage mining is finding the groups which share common interests compared to Rough set agglomerative clustering.
Keywords :
Internet; approximation theory; data mining; pattern clustering; rough set theory; MSNBC Web navigation dataset; Web log data repository; Web usage data clustering; Web usage mining; data mining technique; rough set Dbscan clustering algorithm; rough set similarity upper approximations; set similarities; Clustering algorithms; Data analysis; Data engineering; Data mining; Databases; Educational institutions; Memory; Navigation; Partitioning algorithms; Rough sets; Sequence; Web usage data; rough sets; set approximations; set similarity;
Conference_Titel :
Data Storage and Data Engineering (DSDE), 2010 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4244-5678-9
DOI :
10.1109/DSDE.2010.14