Title :
An expected utility approach to active feature-value acquisition
Author :
Melville, Prem ; Saar-Tsechansky, Maytal ; Provost, Foster ; Mooney, Raymond
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Austin, TX, USA
Abstract :
In many classification tasks, training data have missing feature values that can be acquired at a cost. For building accurate predictive models, acquiring all missing values is often prohibitively expensive or unnecessary, while acquiring a random subset of feature values may not be most effective. The goal of active feature-value acquisition is to incrementally select feature values that are most cost-effective for improving the model´s accuracy. We present an approach that acquires feature values for inducing a classification model based on an estimation of the expected improvement in model accuracy per unit cost. Experimental results demonstrate that our approach consistently reduces the cost of producing a model of a desired accuracy compared to random feature acquisitions.
Keywords :
data analysis; pattern classification; active feature-value acquisition; classification model; classification tasks; data training; predictive models; random feature acquisitions; Classification tree analysis; Costs; Decision trees; Demography; Measurement; Medical treatment; Predictive models; Testing; Training data; Utility theory;
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
Print_ISBN :
0-7695-2278-5
DOI :
10.1109/ICDM.2005.23