DocumentCode :
401690
Title :
A new reduction algorithm - difference-similitude matrix
Author :
Jiang, Hao ; Yan, Pu-Liu ; Xia, De-lin
Author_Institution :
Sch. of Electron. Inf., Wuhan Univ., China
Volume :
3
fYear :
2003
fDate :
2-5 Nov. 2003
Firstpage :
1533
Abstract :
In this paper, we present a new reduction algorithm. The main idea comes from showroom\´s discernibility matrix. Discernibility matrix only takes advantage of the difference of the condition attributes of the objects. But we also take the similitude of the condition attributes into account. By analyzing the difference and similitude of the conditional attributes of IS, the DSM (difference-similitude matrix) is formed. By defining the significance of the attributes and the uniformity of the objects, and analyzing the elements mijd & mijsin DSM, a new data reduction algorithm is put forward. The algorithm can obtain as minimal rules as possible while preserving the consistency of classifications. This reduction algorithm based on DSM is employed to analyze databases from UCI. Through comparing the reduction result of DSM algorithm and the discernibility matrix algorithm, it shows that DSM algorithm can obtain higher reduction rate of objects. Tested by "leave-one-out", DSM algorithm is proved to have higher correctness.
Keywords :
data reduction; information systems; rough set theory; data reduction algorithm; difference-similitude matrix; discernibility matrix algorithm; leave-one-out algorithm; reduction algorithm; rough set theory; Algorithm design and analysis; Classification algorithms; Cybernetics; Data analysis; Data mining; Databases; Information systems; Machine learning; Set theory; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
Type :
conf
DOI :
10.1109/ICMLC.2003.1259738
Filename :
1259738
Link To Document :
بازگشت