Title :
Active feature-value acquisition for classifier induction
Author :
Melville, Prem ; Saar-Tsechansky, Maytal ; Provost, Foster ; Mooney, Raymond
Author_Institution :
Dept. of Comput. Sci., Texas Univ., Austin, TX, USA
Abstract :
Many induction problems include missing data that can be acquired at a cost. For building accurate predictive models, acquiring complete information for all instances is often expensive or unnecessary, while acquiring information for a random subset of instances may not be most effective. Active feature-value acquisition tries to reduce the cost of achieving a desired model accuracy by identifying instances for which obtaining complete information is most informative. We present an approach in which instances are selected for acquisition based on the current model´s accuracy and its confidence in the prediction. Experimental results demonstrate that our approach can induce accurate models using substantially fewer feature-value acquisitions as compared to alternative policies.
Keywords :
classification; data acquisition; active feature-value acquisition; classifier induction; missing data; predictive model; Costs; Current measurement; Data mining; Design for experiments; Predictive models; Sampling methods;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10075