DocumentCode :
2186493
Title :
HITS is principal components analysis
Author :
Saerens, Marco ; Fouss, Francois
Author_Institution :
Inf. Syst. Res. Unit, Univ. Catholique de Louvain, Louvain-la-Neuve, Belgium
fYear :
2005
fDate :
19-22 Sept. 2005
Firstpage :
782
Lastpage :
785
Abstract :
In this work, we show that Kleinberg´s hubs and authorities model (HITS) is simply principal components analysis (PCA; maybe the most widely used multivariate statistical analysis method), albeit without centering, applied to the adjacency matrix of the graph of Web pages. We further show that a variant of HITS, SALSA, is closely related to correspondence analysis, another standard multivariate statistical analysis method. In addition, to provide a clear statistical interpretation for HITS, this result suggests to rely on existing work already published in the multivariate statistical analysis literature (extensions of PCA or correspondence analysis) in order to analyse or design new Web pages scoring procedures.
Keywords :
Web sites; graph theory; matrix algebra; principal component analysis; HITS; SALSA; Web page; adjacency matrix; correspondence analysis; hubs and authorities model; multivariate statistical analysis; principal components analysis; Algorithm design and analysis; Computer science; Data mining; Information systems; Principal component analysis; Statistical analysis; Statistics; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2415-X
Type :
conf
DOI :
10.1109/WI.2005.71
Filename :
1517954
Link To Document :
بازگشت