DocumentCode
1863759
Title
An Mahalanobis distances based text clustering algorithm
Author
Cuixia Li ; Yingjun Tan ; Jinsheng Kong
Author_Institution
School of Software, Zhengzhou University, 450002, China
fYear
2012
fDate
3-5 March 2012
Firstpage
465
Lastpage
468
Abstract
Though traditional fuzzy partitional text clustering algorithms are one of the most widely used methods, they are only fit to detect spherical structural clusters because they are based on Euclidean distances. When the data set has high dimensions, the accuracy and efficiency will decrease. Focus on solving this problem, a Fuzzy Mahalanobis distances based text clustering algorithm was proposed. Otherwise, finding eigenvalue and eigenvectors of a symmetric matrix or computing pseudoinvertion were used to avoid the singular values problem when finding Mahalanobis distances. The numerical experimental results show the validity of the proposed methods.
Keywords
Euclidean distance; Fuzzy C-means; Mahalanobis distance; clustering; text clustering;
fLanguage
English
Publisher
iet
Conference_Titel
Automatic Control and Artificial Intelligence (ACAI 2012), International Conference on
Conference_Location
Xiamen
Electronic_ISBN
978-1-84919-537-9
Type
conf
DOI
10.1049/cp.2012.1017
Filename
6492624
Link To Document