Title :
Opening black box Data Mining models using Sensitivity Analysis
Author :
Cortez, Paulo ; Embrechts, Mark J.
Author_Institution :
Dept. of Inf. Syst., Univ. of Minho, Guimaräes, Portugal
Abstract :
There are several supervised learning Data Mining (DM) methods, such as Neural Networks (NN), Support Vector Machines (SVM) and ensembles, that often attain high quality predictions, although the obtained models are difficult to interpret by humans. In this paper, we open these black box DM models by using a novel visualization approach that is based on a Sensitivity Analysis (SA) method. In particular, we propose a Global SA (GSA), which extends the applicability of previous SA methods (e.g. to classification tasks), and several visualization techniques (e.g. variable effect characteristic curve), for assessing input relevance and effects on the model´s responses. We show the GSA capabilities by conducting several experiments, using a NN ensemble and SVM model, in both synthetic and real-world datasets.
Keywords :
data mining; neural nets; support vector machines; Global SA; black box; data mining models; ensembles; neural networks; sensitivity analysis; supervised learning; support vector machines; visualization techniques; Analytical models; Artificial neural networks; Delta modulation; Predictive models; Sensitivity; Support vector machines;
Conference_Titel :
Computational Intelligence and Data Mining (CIDM), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9926-7
DOI :
10.1109/CIDM.2011.5949423