DocumentCode :
3182232
Title :
Pin-pointing concept descriptions
Author :
Sönströd, Cecilia ; Johansson, Ulf ; Boström, Henrik ; Norinder, Ulf
Author_Institution :
Sch. of Bus. & Inf., Univ. of Boras, Borås, Sweden
fYear :
2010
fDate :
10-13 Oct. 2010
Firstpage :
2956
Lastpage :
2963
Abstract :
In this study, the task of obtaining accurate and comprehensible concept descriptions of a specific set of production instances has been investigated. The suggested method, inspired by rule extraction and transductive learning, uses a highly accurate opaque model, called an oracle, to coach construction of transparent decision list models. The decision list algorithms evaluated are JRip and four different variants of Chipper, a technique specifically developed for concept description. Using 40 real-world data sets from the drug discovery domain, the results show that employing an oracle coach to label the production data resulted in significantly more accurate and smaller models for almost all techniques. Furthermore, augmenting normal training data with production data labeled by the oracle also led to significant increases in predictive performance, but with a slight increase in model size. Of the techniques evaluated, normal Chipper optimizing FOIL´s information gain and allowing conjunctive rules was clearly the best. The overall conclusion is that oracle coaching works very well for concept description.
Keywords :
data mining; decision making; learning (artificial intelligence); pharmaceutical industry; Chipper optimizing FOIL information; JRip; concept description; data augmentation; drug discovery domain; oracle coaching; pin pointing concept; rule extraction; transductive learning; transparent decision list model; Biology; Synthetic aperture sonar; Concept description; Data mining; Decision lists;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1062-922X
Print_ISBN :
978-1-4244-6586-6
Type :
conf
DOI :
10.1109/ICSMC.2010.5641998
Filename :
5641998
Link To Document :
بازگشت