DocumentCode
2771408
Title
An algorithm for computing attribute reducts based on graph search strategy
Author
Shi, Minghui ; Zhou, Changle ; Chao, Fei ; Jiang, Min
Author_Institution
Dept. of Cognitive Sci., Xiamen Univ., Xiamen, China
fYear
2012
fDate
10-15 June 2012
Firstpage
1
Lastpage
8
Abstract
Attribute reducts can discover previously unknown, non-trivial and useful abstractions from the data in large databases. However, many methods for finding attribute reducts from large data sets always meet a difficult problem of combination explosion. To overcome the problem and find some attribute reducts with high efficiency, the algorithm CARHS was proposed. The basic idea of CARHS is: 1) transform the problem into an equivalent one that searches paths, from which attribute reducts can be easily derived, from a graph; 2) employ high efficient heuristic rules during the course of depth-first search on the graph. By means of the heuristic rules, those paths that would not derive attribute reducts could be blocked as early as possible, furthermore, for those paths that would derive the same attribute reduct, only one of them could complete the course of search, and the others could be blocked as early as possible. Thus some attribute reducts could be found by CARHS with high efficiency even when dealing with huge data sets. The transformation of the problem, novel concepts, the heuristic search rules, and the algorithm CARHS were illustrated in detail by some examples. At last, The experiment on three classic UCI data sets showed the effect of the heuristic search rules and the efficiency of the algorithm CARHS.
Keywords
data mining; graph theory; learning (artificial intelligence); tree searching; very large databases; CARHS; attribute reducts; classic UCI data sets; depth-first search; graph search strategy; heuristic rules; heuristic search rules; large data sets; large databases; nontrivial abstractions; problem transformation; Data mining; Explosions; Heuristic algorithms; Information systems; Machine learning; Search problems; Transforms; attribute reduct; data mining; machine learning; reduct discernibility graph;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location
Brisbane, QLD
ISSN
2161-4393
Print_ISBN
978-1-4673-1488-6
Electronic_ISBN
2161-4393
Type
conf
DOI
10.1109/IJCNN.2012.6252484
Filename
6252484
Link To Document