Title :
Distributional clustering using nonnegative matrix factorization
Author :
Zhu, Zhenfeng ; Ye, Yangdong
Author_Institution :
Sch. of Inf. Eng., Zhengzhou Univ., Zhengzhou, China
Abstract :
In this paper, we propose an iterative distributional clustering algorithm based on non-negative matrix factorization (DCMF). When factorizing a data matrix A into C×M, an objective function is defined to impose the conditional distribution constraints on the base matrix C and the coefficient matrix M. It has been observed that, in many applications, the conditional distributions of instances are often employed to normalize the data dimensions. Taking these factors into account, we simplify the existent updating rules and obtain the iterative algorithm DCMF. This algorithm satisfies the constraints described above on condition that the instance matrix is preprocessed as a conditional distribution. DCMF is simple, effective, and only needs to initialize the coefficient matrix. As a result, the base matrix can be viewed as a centroid matrix and the coefficient matrix just records the membership of fuzzy clustering. Compared with several other factorization algorithms, the experimental results on text, gene, and image data demonstrate that DCMF achieves 8.06% clustering accuracy improvement, 35.08% computational time reduction, and 61.30% hard clustering fuzziness decrease.
Keywords :
fuzzy set theory; iterative methods; matrix decomposition; pattern clustering; DCMF; base matrix; centroid matrix; coefficient matrix; conditional distribution constraints; data dimensions; data matrix; fuzzy clustering; instance matrix; iterative distributional clustering algorithm; nonnegative matrix factorization; Accuracy; Artificial neural networks; Clustering algorithms; Computational efficiency; Linear programming; Runtime; Uncertainty; Nonnegative matrix factorization; clustering; conditional distribution; fuzziness;
Conference_Titel :
Intelligent Control and Automation (WCICA), 2012 10th World Congress on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-1397-1
DOI :
10.1109/WCICA.2012.6359370