DocumentCode :
2957055
Title :
Visualization of differences between rules´ syntactic and semantic similarities
Author :
Tsumoto, Shusaku
Author_Institution :
Dept. of Med. Informatics, Shimane Univ., Japan
Volume :
4
fYear :
2005
fDate :
10-12 Oct. 2005
Firstpage :
3588
Abstract :
One of the most important problems with rule induction methods is that it is very difficult for domain experts to check millions of rules generated from large datasets. The discovery from these rules requires deep interpretation from domain knowledge. Although several solutions have been proposed in the studies on data mining and knowledge discovery, these studies are not focused on similarities between rules obtained. When one rule r1 has reasonable features and the other rule r2 with high similarity to r1 includes unexpected factors, the relations between these rules will become a trigger to the discovery of knowledge. In this paper, we propose a visualization approach to show the similarity relations between rules based on multidimensional scaling, which assign a two-dimensional cartesian coordinate to each data point from the information about similarities between this data and others data. We evaluated this method on two medical data sets, whose experimental results show that knowledge useful for domain experts could be found.
Keywords :
data mining; data visualisation; knowledge based systems; 2D cartesian coordinate; data mining; domain knowledge; knowledge discovery; medical data sets; multidimensional scaling; rule induction; rules syntactic similarity; semantic similarity; visualization approach; Biomedical informatics; Bone diseases; Cities and towns; Data analysis; Data mining; Data visualization; Diabetes; Induction generators; Liver diseases; Microorganisms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2005 IEEE International Conference on
Print_ISBN :
0-7803-9298-1
Type :
conf
DOI :
10.1109/ICSMC.2005.1571704
Filename :
1571704
Link To Document :
بازگشت