DocumentCode
2849757
Title
Multi-view clustering
Author
Bickel, Steffen ; Scheffer, Tobias
Author_Institution
Dept. of Comput. Sci., Humboldt-Univ. zu Berlin, Germany
fYear
2004
fDate
1-4 Nov. 2004
Firstpage
19
Lastpage
26
Abstract
We consider clustering problems in which the available attributes can be split into two independent subsets, such that either subset suffices for learning. Example applications of this multi-view setting include clustering of Web pages which have an intrinsic view (the pages themselves) and an extrinsic view (e.g., anchor texts of inbound hyperlinks); multi-view learning has so far been studied in the context of classification. We develop and study partitioning and agglomerative, hierarchical multi-view clustering algorithms for text data. We find empirically that the multi-view versions of k-means and EM greatly improve on their single-view counterparts. By contrast, we obtain negative results for agglomerative hierarchical multi-view clustering. Our analysis explains this surprising phenomenon.
Keywords
data mining; learning (artificial intelligence); pattern classification; pattern clustering; set theory; text analysis; Web pages; agglomerative hierarchical multiview clustering; clustering algorithm; independent subsets; multiview learning; partitioning; text data; Data mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN
0-7695-2142-8
Type
conf
DOI
10.1109/ICDM.2004.10095
Filename
1410262
Link To Document