DocumentCode :
2226599
Title :
Matrix dimensionality reduction for mining Web logs
Author :
Lu, Jianjiang ; Xu, Baowen ; Yang, Hongji
Author_Institution :
Dept. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
fYear :
2003
fDate :
13-17 Oct. 2003
Firstpage :
405
Lastpage :
408
Abstract :
Web-based logs contain potentially useful data with which designers can assess the usability and effectiveness of their choices. Clustering techniques have been used to automatically discover typical user profiles from Web access logs recently. But it is a challenging problem to design effective similarity measure between the session vectors, which are usually high dimensional and sparse. Nonnegative matrix factorisation approaches are applied to dimensionality reduction of the session-URL matrix, and the spherical k-means algorithm is used to partition the projecting vectors of the user session vectors into several clusters. Two methods for discovering typical user session profiles from the clusters are presented last. The results of experiment show that our algorithms can mine interesting user profiles effectively.
Keywords :
Web sites; customer profiles; data mining; matrix decomposition; pattern clustering; vectors; Clustering techniques; Web log mining; Web sites; matrix dimensionality reduction; nonnegative matrix factorization; session-URL matrix; spherical k-means algorithm; user session profiles; user session vectors partitioning; Clustering algorithms; Computer science; Data mining; Educational technology; Laboratories; Partitioning algorithms; Programmable logic arrays; Sparse matrices; Uniform resource locators; Web server;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1932-6
Type :
conf
DOI :
10.1109/WI.2003.1241222
Filename :
1241222
Link To Document :
بازگشت