DocumentCode
239282
Title
Multi-view clustering of web documents using multi-objective genetic algorithm
Author
Wahid, Abdul ; Xiaoying Gao ; Andreae, Peter
Author_Institution
Victoria Univ. of Wellington, Wellington, New Zealand
fYear
2014
fDate
6-11 July 2014
Firstpage
2625
Lastpage
2632
Abstract
Clustering ensembles are a common approach to clustering problem, which combine a collection of clustering into a superior solution. The key issues are how to generate different candidate solutions and how to combine them. Common approach for generating candidate clustering solutions ignores the multiple representations of the data (i.e., multiple views) and the standard approach of simply selecting the best solution from candidate clustering solutions ignores the fact that there may be a set of clusters from different candidate clustering solutions which can form a better clustering solution. This paper presents a new clustering method that exploits multiple views to generate different clustering solutions and then selects a combination of clusters to form a final clustering solution. Our method is based on Nondominated Sorting Genetic Algorithm (NSGA-II), which is a multi-objective optimization approach. Our new method is compared with five existing algorithms on three data sets that have increasing difficulty. The results show that our method significantly outperforms other methods.
Keywords
Internet; document handling; genetic algorithms; pattern clustering; sorting; NSGA-II; Web documents; candidate clustering solutions; clustering ensembles; multiobjective genetic algorithm; multiobjective optimization approach; multiple data representations; multiview clustering problem; nondominated sorting genetic algorithm; Clustering algorithms; Evolutionary computation; Linear programming; Optimization; Sociology; Standards; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2014 IEEE Congress on
Conference_Location
Beijing
Print_ISBN
978-1-4799-6626-4
Type
conf
DOI
10.1109/CEC.2014.6900586
Filename
6900586
Link To Document