DocumentCode :
1863759
Title :
An Mahalanobis distances based text clustering algorithm
Author :
Cuixia Li ; Yingjun Tan ; Jinsheng Kong
Author_Institution :
School of Software, Zhengzhou University, 450002, China
fYear :
2012
fDate :
3-5 March 2012
Firstpage :
465
Lastpage :
468
Abstract :
Though traditional fuzzy partitional text clustering algorithms are one of the most widely used methods, they are only fit to detect spherical structural clusters because they are based on Euclidean distances. When the data set has high dimensions, the accuracy and efficiency will decrease. Focus on solving this problem, a Fuzzy Mahalanobis distances based text clustering algorithm was proposed. Otherwise, finding eigenvalue and eigenvectors of a symmetric matrix or computing pseudoinvertion were used to avoid the singular values problem when finding Mahalanobis distances. The numerical experimental results show the validity of the proposed methods.
Keywords :
Euclidean distance; Fuzzy C-means; Mahalanobis distance; clustering; text clustering;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Automatic Control and Artificial Intelligence (ACAI 2012), International Conference on
Conference_Location :
Xiamen
Electronic_ISBN :
978-1-84919-537-9
Type :
conf
DOI :
10.1049/cp.2012.1017
Filename :
6492624
Link To Document :
بازگشت