• DocumentCode
    1863759
  • Title

    An Mahalanobis distances based text clustering algorithm

  • Author

    Cuixia Li ; Yingjun Tan ; Jinsheng Kong

  • Author_Institution
    School of Software, Zhengzhou University, 450002, China
  • fYear
    2012
  • fDate
    3-5 March 2012
  • Firstpage
    465
  • Lastpage
    468
  • Abstract
    Though traditional fuzzy partitional text clustering algorithms are one of the most widely used methods, they are only fit to detect spherical structural clusters because they are based on Euclidean distances. When the data set has high dimensions, the accuracy and efficiency will decrease. Focus on solving this problem, a Fuzzy Mahalanobis distances based text clustering algorithm was proposed. Otherwise, finding eigenvalue and eigenvectors of a symmetric matrix or computing pseudoinvertion were used to avoid the singular values problem when finding Mahalanobis distances. The numerical experimental results show the validity of the proposed methods.
  • Keywords
    Euclidean distance; Fuzzy C-means; Mahalanobis distance; clustering; text clustering;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Automatic Control and Artificial Intelligence (ACAI 2012), International Conference on
  • Conference_Location
    Xiamen
  • Electronic_ISBN
    978-1-84919-537-9
  • Type

    conf

  • DOI
    10.1049/cp.2012.1017
  • Filename
    6492624