Title :
Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization
Author :
Li, Tao ; Ding, Chris ; Jordan, Michael I.
Author_Institution :
Florida Int. Univ., Miami
Abstract :
Consensus clustering and semi-supervised clustering are important extensions of the standard clustering paradigm. Consensus clustering (also known as aggregation of clustering) can improve clustering robustness, deal with distributed and heterogeneous data sources and make use of multiple clustering criteria. Semi-supervised clustering can integrate various forms of background knowledge into clustering. In this paper, we show how consensus and semi-supervised clustering can be formulated within the framework of nonnegative matrix factorization (NMF). We show that this framework yields NMF-based algorithms that are: (1) extremely simple to implement; (2) provably correct and provably convergent. We conduct a wide range of comparative experiments that demonstrate the effectiveness of this NMF-based approach.
Keywords :
matrix decomposition; pattern clustering; consensus clustering; nonnegative matrix factorization; semisupervised clustering problems; Clustering algorithms; Data analysis; Data mining; Engineering profession; Matrix decomposition; Partitioning algorithms; Robustness; Statistics; USA Councils;
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3018-5
DOI :
10.1109/ICDM.2007.98