DocumentCode
2334543
Title
Indiscernibility degree of objects for evaluating simplicity of knowledge in the clustering procedure
Author
Hirano, Shoji ; Tsumoto, Shusaku
Author_Institution
Dept. of Med. Informatics, Shimane Med. Univ., Izumo, Japan
fYear
2001
fDate
2001
Firstpage
211
Lastpage
217
Abstract
The paper presents a novel, rough set-based clustering method that enables the evaluation of classification knowledge simplicity during the clustering procedure. The method iteratively refines equivalence relations so that they become a more simple set of relations that give adequate coarse classification to the objects. At each step of the iteration, the importance of the equivalence relation is evaluated on the basis of the newly introduced measure, indiscernibility degree. An indiscernibility degree is defined as a ratio of equivalence relations that classify the two objects into the same equivalence class. If an equivalence relation has the ability to discern two objects that have a high indiscernibility degree, a very fine classification is performed and then modified to regard them as indiscernible objects. The refinement is repeated, decreasing the threshold level of indiscernibility degree, and finally simple clusters can be obtained. Experimental results on the artificial data shows that iterative refinement of equivalence relation leads to successful generation of coarse clusters that can be represented by simple knowledge
Keywords
data mining; equivalence classes; pattern clustering; rough set theory; very large databases; artificial data; classification knowledge simplicity evaluation; clustering procedure; coarse classification; coarse clusters; equivalence class; equivalence relation; equivalence relations; indiscernibility degree; iterative refinement; rough set-based clustering method; simple clusters; threshold level; Algorithm design and analysis; Biomedical informatics; Clustering algorithms; Clustering methods; Data analysis; Databases; Rough sets; Scalability; Sections; Set theory;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
0-7695-1119-8
Type
conf
DOI
10.1109/ICDM.2001.989521
Filename
989521
Link To Document