• DocumentCode
    239282
  • Title

    Multi-view clustering of web documents using multi-objective genetic algorithm

  • Author

    Wahid, Abdul ; Xiaoying Gao ; Andreae, Peter

  • Author_Institution
    Victoria Univ. of Wellington, Wellington, New Zealand
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    2625
  • Lastpage
    2632
  • Abstract
    Clustering ensembles are a common approach to clustering problem, which combine a collection of clustering into a superior solution. The key issues are how to generate different candidate solutions and how to combine them. Common approach for generating candidate clustering solutions ignores the multiple representations of the data (i.e., multiple views) and the standard approach of simply selecting the best solution from candidate clustering solutions ignores the fact that there may be a set of clusters from different candidate clustering solutions which can form a better clustering solution. This paper presents a new clustering method that exploits multiple views to generate different clustering solutions and then selects a combination of clusters to form a final clustering solution. Our method is based on Nondominated Sorting Genetic Algorithm (NSGA-II), which is a multi-objective optimization approach. Our new method is compared with five existing algorithms on three data sets that have increasing difficulty. The results show that our method significantly outperforms other methods.
  • Keywords
    Internet; document handling; genetic algorithms; pattern clustering; sorting; NSGA-II; Web documents; candidate clustering solutions; clustering ensembles; multiobjective genetic algorithm; multiobjective optimization approach; multiple data representations; multiview clustering problem; nondominated sorting genetic algorithm; Clustering algorithms; Evolutionary computation; Linear programming; Optimization; Sociology; Standards; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2014 IEEE Congress on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6626-4
  • Type

    conf

  • DOI
    10.1109/CEC.2014.6900586
  • Filename
    6900586