DocumentCode :
1840538
Title :
Hierarchical-Hyperspherical Divisive Fuzzy C-Means (H2D-FCM) Clustering for Information Retrieval
Author :
Bordogna, Gloria ; Pasi, Gabriella
Volume :
1
fYear :
2009
fDate :
15-18 Sept. 2009
Firstpage :
614
Lastpage :
621
Abstract :
In this paper an original soft hierarchical Fuzzy Clustering algorithm is proposed, named Hierarchical Hyper-spherical Divisive Fuzzy C-Means (H2D-FCM), with the following characteristics: it generates a “soft” hierarchy in which a document can belong to several child clusters of a node, and the clusters in the same hierarchical level are more specific (general) than the clusters in the upper (lower) level. The proposed algorithm is a divisive algorithm based on a modified bisective K-Means, applying a modified probabilistic Fuzzy C Means algorithm to divide each node into child-nodes. The algorithm determines the proper number of cluster to generate at the first level based on an entropy measure and decides if a node can be further split based on a “density” measure. The paper presents the algorithm and its evaluations on two standard collections.
Keywords :
Character generation; Clustering algorithms; Conferences; Content based retrieval; Density measurement; Entropy; Information filtering; Information retrieval; Intelligent agent; Unsupervised learning; Fuzzy C Means Algorithm; Fuzzy Hierarchical clustering; Vector Space Model.; documents clustering; unsupervised hierarchical categorization;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT '09. IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Milan, Italy
Print_ISBN :
978-0-7695-3801-3
Electronic_ISBN :
978-1-4244-5331-3
Type :
conf
DOI :
10.1109/WI-IAT.2009.104
Filename :
5284910
Link To Document :
بازگشت