Title :
Web user profiling using hierarchical clustering with improved similarity measure
Author :
Algiriyage, Nilani ; Jayasena, Sanath ; Dias, Gihan
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Moratuwa, Moratuwa, Sri Lanka
Abstract :
Web user profiling targets grouping users in to clusters with similar interests. Web sites are attracted by many visitors and gaining insight to the patterns of access leaves lot of information. Web server access log files record every single request processed by web site visitors. Applying web usage mining techniques allow to identify interesting patterns. In this paper we have improved the similarity measure proposed by Velásquez et al. [1] and used it as the distance measure in an agglomerative hierarchical clustering for a data set from an online banking web site. To generate profiles, frequent item set mining is applied over the clusters. Our results show that proper visitor clustering can be achieved with the improved similarity measure.
Keywords :
Internet; Web sites; data mining; pattern clustering; Web server access; Web usage mining techniques; Web user profiling; hierarchical clustering; online banking Web site; Data mining; Mathematical model; Navigation; Time measurement; Web pages; Web servers;
Conference_Titel :
Moratuwa Engineering Research Conference (MERCon), 2015
Conference_Location :
Moratuwa
DOI :
10.1109/MERCon.2015.7112362