• DocumentCode
    578331
  • Title

    Distributional clustering using nonnegative matrix factorization

  • Author

    Zhu, Zhenfeng ; Ye, Yangdong

  • Author_Institution
    Sch. of Inf. Eng., Zhengzhou Univ., Zhengzhou, China
  • fYear
    2012
  • fDate
    6-8 July 2012
  • Firstpage
    4705
  • Lastpage
    4711
  • Abstract
    In this paper, we propose an iterative distributional clustering algorithm based on non-negative matrix factorization (DCMF). When factorizing a data matrix A into C×M, an objective function is defined to impose the conditional distribution constraints on the base matrix C and the coefficient matrix M. It has been observed that, in many applications, the conditional distributions of instances are often employed to normalize the data dimensions. Taking these factors into account, we simplify the existent updating rules and obtain the iterative algorithm DCMF. This algorithm satisfies the constraints described above on condition that the instance matrix is preprocessed as a conditional distribution. DCMF is simple, effective, and only needs to initialize the coefficient matrix. As a result, the base matrix can be viewed as a centroid matrix and the coefficient matrix just records the membership of fuzzy clustering. Compared with several other factorization algorithms, the experimental results on text, gene, and image data demonstrate that DCMF achieves 8.06% clustering accuracy improvement, 35.08% computational time reduction, and 61.30% hard clustering fuzziness decrease.
  • Keywords
    fuzzy set theory; iterative methods; matrix decomposition; pattern clustering; DCMF; base matrix; centroid matrix; coefficient matrix; conditional distribution constraints; data dimensions; data matrix; fuzzy clustering; instance matrix; iterative distributional clustering algorithm; nonnegative matrix factorization; Accuracy; Artificial neural networks; Clustering algorithms; Computational efficiency; Linear programming; Runtime; Uncertainty; Nonnegative matrix factorization; clustering; conditional distribution; fuzziness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation (WCICA), 2012 10th World Congress on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4673-1397-1
  • Type

    conf

  • DOI
    10.1109/WCICA.2012.6359370
  • Filename
    6359370