Title :
Clustering data and imprecise concepts
Author :
Zhang, Weifeng ; Qin, Zengchang
Author_Institution :
Intell. Comput. & Machine Learning Lab., Beihang Univ., Beijing, China
Abstract :
Cluster analysis is the assignment of grouping a set of observations into clusters so that observations in the same cluster are similar in some sense. One of the key features for clustering is how to define a sensible similarity measure. However, classical clustering algorithms have no ability to cluster data instances and imprecise concepts using traditional distance measures. In this paper, we proposed a (dis)similarity measure based on a new knowledge representation framework called label semantics. Based on this new measure, we can automatically cluster data instance and descriptive concepts represented by logical expressions of linguistic labels. Experimental results on a toy problem in image classification demonstrate the effectiveness of the new proposed clustering algorithm. Since the new proposed measure can be extended to measuring distance between any two granularities, the new clustering algorithms can also be extended to clustering data instance and imprecise concepts represented by other granularities.
Keywords :
data mining; knowledge representation; pattern clustering; data clustering; distance measure; imprecise concept; knowledge representation; label semantics; linguistic label; logical expression; similarity measure; Algorithm design and analysis; Clustering algorithms; Humans; Image color analysis; Pragmatics; Semantics; Silicon; Clustering; Imprecise Concept Modeling; K-means; Label Semantics; Linguistic Expressions;
Conference_Titel :
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-7315-1
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2011.6007372