Title :
A new reduction algorithm - difference-similitude matrix
Author :
Jiang, Hao ; Yan, Pu-Liu ; Xia, De-lin
Author_Institution :
Sch. of Electron. Inf., Wuhan Univ., China
Abstract :
In this paper, we present a new reduction algorithm. The main idea comes from showroom\´s discernibility matrix. Discernibility matrix only takes advantage of the difference of the condition attributes of the objects. But we also take the similitude of the condition attributes into account. By analyzing the difference and similitude of the conditional attributes of IS, the DSM (difference-similitude matrix) is formed. By defining the significance of the attributes and the uniformity of the objects, and analyzing the elements mijd & mijsin DSM, a new data reduction algorithm is put forward. The algorithm can obtain as minimal rules as possible while preserving the consistency of classifications. This reduction algorithm based on DSM is employed to analyze databases from UCI. Through comparing the reduction result of DSM algorithm and the discernibility matrix algorithm, it shows that DSM algorithm can obtain higher reduction rate of objects. Tested by "leave-one-out", DSM algorithm is proved to have higher correctness.
Keywords :
data reduction; information systems; rough set theory; data reduction algorithm; difference-similitude matrix; discernibility matrix algorithm; leave-one-out algorithm; reduction algorithm; rough set theory; Algorithm design and analysis; Classification algorithms; Cybernetics; Data analysis; Data mining; Databases; Information systems; Machine learning; Set theory; Testing;
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
DOI :
10.1109/ICMLC.2003.1259738