DocumentCode :
3620792
Title :
Finding reducts without building the discernibility matrix
Author :
M. Korzen;S. Jaroszewicz
Author_Institution :
Fac. of Comput. Sci. & Inf. Syst., Tech. Univ. Szczecin, Poland
fYear :
2005
fDate :
6/27/1905 12:00:00 AM
Firstpage :
450
Lastpage :
455
Abstract :
We present algorithms for fast generation of short reducts which avoid building the discernibility matrix explicitly. We show how information obtained from this matrix can be obtained based only on the distributions of attribute values. Since the size of discernibility matrix is quadratic in the number of data records, not building the matrix explicitly gives a very significant speedup and makes it possible to find reducts even in very large databases. Algorithms are given for both absolute and relative reducts. Experiments show that our approach outperforms other reduct finding algorithms. Furthermore it is shown that many heuristic reduct finding algorithms using the discernibility matrix in fact select attributes based on their Gini index. A new definition of conditional Gini index is presented, motivated by reduct finding heuristics.
Keywords :
"Information systems","Databases","Heuristic algorithms","Computer science","Intelligent systems"
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications, 2005. ISDA ´05. Proceedings. 5th International Conference on
Print_ISBN :
0-7695-2286-6
Type :
conf
DOI :
10.1109/ISDA.2005.45
Filename :
1578826
Link To Document :
بازگشت