DocumentCode :
2805161
Title :
Vectors and Graphs: Two Representations to Cluster Web Sites Using Hyperstructure
Author :
Meneses, Esteban
Author_Institution :
Comput. Res. Center, Costa Rica Inst. of Technol., Cartago
fYear :
2006
fDate :
Oct. 2006
Firstpage :
172
Lastpage :
178
Abstract :
Web site clustering consists in finding meaningful groups of related Web sites. How related is some Web site to another is a question that depends on how we represent Web sites. Traditionally, vectors and graphs have been two important structures to represent individuals in a population. Both representations can play an important role in the Web area if hyper structure is considered. By analyzing the way Web sites are linked, we can build vectors or graphs to understand how a Web site collection is partitioned. In this paper, we analyze these two models and four associated algorithms: k-means and self-organizing maps (SOM) with vectors, simulated annealing and genetic algorithms with graphs. For testing these ideas we clustered some Web sites in the Central American Web. We compare the results for clustering this Web site collection using both models and show what kind of clusters each one produces
Keywords :
Web sites; genetic algorithms; graph theory; pattern clustering; self-organising feature maps; simulated annealing; vectors; Web site clustering; genetic algorithms; graphical representation; hyperstructure; k-means algorithm; self-organizing maps; simulated annealing; vector representation; Algorithm design and analysis; Clustering algorithms; Libraries; Partitioning algorithms; Self organizing feature maps; Simulated annealing; Testing; Web pages; Web sites; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Congress, 2006. LA-Web '06. Fourth Latin American
Conference_Location :
Cholula
Print_ISBN :
0-7695-2693-4
Type :
conf
DOI :
10.1109/LA-WEB.2006.36
Filename :
4022107
Link To Document :
بازگشت