• DocumentCode
    2805161
  • Title

    Vectors and Graphs: Two Representations to Cluster Web Sites Using Hyperstructure

  • Author

    Meneses, Esteban

  • Author_Institution
    Comput. Res. Center, Costa Rica Inst. of Technol., Cartago
  • fYear
    2006
  • fDate
    Oct. 2006
  • Firstpage
    172
  • Lastpage
    178
  • Abstract
    Web site clustering consists in finding meaningful groups of related Web sites. How related is some Web site to another is a question that depends on how we represent Web sites. Traditionally, vectors and graphs have been two important structures to represent individuals in a population. Both representations can play an important role in the Web area if hyper structure is considered. By analyzing the way Web sites are linked, we can build vectors or graphs to understand how a Web site collection is partitioned. In this paper, we analyze these two models and four associated algorithms: k-means and self-organizing maps (SOM) with vectors, simulated annealing and genetic algorithms with graphs. For testing these ideas we clustered some Web sites in the Central American Web. We compare the results for clustering this Web site collection using both models and show what kind of clusters each one produces
  • Keywords
    Web sites; genetic algorithms; graph theory; pattern clustering; self-organising feature maps; simulated annealing; vectors; Web site clustering; genetic algorithms; graphical representation; hyperstructure; k-means algorithm; self-organizing maps; simulated annealing; vector representation; Algorithm design and analysis; Clustering algorithms; Libraries; Partitioning algorithms; Self organizing feature maps; Simulated annealing; Testing; Web pages; Web sites; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Congress, 2006. LA-Web '06. Fourth Latin American
  • Conference_Location
    Cholula
  • Print_ISBN
    0-7695-2693-4
  • Type

    conf

  • DOI
    10.1109/LA-WEB.2006.36
  • Filename
    4022107