DocumentCode :
2866224
Title :
An expected utility approach to active feature-value acquisition
Author :
Melville, Prem ; Saar-Tsechansky, Maytal ; Provost, Foster ; Mooney, Raymond
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Austin, TX, USA
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
In many classification tasks, training data have missing feature values that can be acquired at a cost. For building accurate predictive models, acquiring all missing values is often prohibitively expensive or unnecessary, while acquiring a random subset of feature values may not be most effective. The goal of active feature-value acquisition is to incrementally select feature values that are most cost-effective for improving the model´s accuracy. We present an approach that acquires feature values for inducing a classification model based on an estimation of the expected improvement in model accuracy per unit cost. Experimental results demonstrate that our approach consistently reduces the cost of producing a model of a desired accuracy compared to random feature acquisitions.
Keywords :
data analysis; pattern classification; active feature-value acquisition; classification model; classification tasks; data training; predictive models; random feature acquisitions; Classification tree analysis; Costs; Decision trees; Demography; Measurement; Medical treatment; Predictive models; Testing; Training data; Utility theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.23
Filename :
1565772
Link To Document :
بازگشت