DocumentCode
2805161
Title
Vectors and Graphs: Two Representations to Cluster Web Sites Using Hyperstructure
Author
Meneses, Esteban
Author_Institution
Comput. Res. Center, Costa Rica Inst. of Technol., Cartago
fYear
2006
fDate
Oct. 2006
Firstpage
172
Lastpage
178
Abstract
Web site clustering consists in finding meaningful groups of related Web sites. How related is some Web site to another is a question that depends on how we represent Web sites. Traditionally, vectors and graphs have been two important structures to represent individuals in a population. Both representations can play an important role in the Web area if hyper structure is considered. By analyzing the way Web sites are linked, we can build vectors or graphs to understand how a Web site collection is partitioned. In this paper, we analyze these two models and four associated algorithms: k-means and self-organizing maps (SOM) with vectors, simulated annealing and genetic algorithms with graphs. For testing these ideas we clustered some Web sites in the Central American Web. We compare the results for clustering this Web site collection using both models and show what kind of clusters each one produces
Keywords
Web sites; genetic algorithms; graph theory; pattern clustering; self-organising feature maps; simulated annealing; vectors; Web site clustering; genetic algorithms; graphical representation; hyperstructure; k-means algorithm; self-organizing maps; simulated annealing; vector representation; Algorithm design and analysis; Clustering algorithms; Libraries; Partitioning algorithms; Self organizing feature maps; Simulated annealing; Testing; Web pages; Web sites; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Congress, 2006. LA-Web '06. Fourth Latin American
Conference_Location
Cholula
Print_ISBN
0-7695-2693-4
Type
conf
DOI
10.1109/LA-WEB.2006.36
Filename
4022107
Link To Document