DocumentCode :
2710133
Title :
A Conservative Feature Subset Selection Algorithm with Missing Data
Author :
Aussem, Alex ; de Morais, S.R.
Author_Institution :
Univ. de Lyon, Lyon
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
725
Lastpage :
730
Abstract :
This paper introduces a novel conservative feature subset selection method with incomplete data sets. The method is conservative in the sense that it selects the minimal subset of features that renders the rest of the features independent of the target (the class variable) without making any assumption about the missing data mechanism. This is achieved in the context of determining the Markov blanket of the target that reflects the worst-case assumption about the missing data mechanism, including the case when data is not missing at random. An application of the method on synthetic incomplete data is carried out to illustrate its practical relevance. The method is compared against state-of-the-art approaches such as the expectation maximization (EM) algorithm and the available case technique.
Keywords :
Markov processes; data mining; Markov blanket; conservative feature subset selection method; incomplete data sets; missing data; Bayesian methods; Data mining; Monte Carlo methods; Performance evaluation; Probability distribution; Robustness; Sampling methods; Spatial databases; Testing; Feature selection; bayesian networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location :
Pisa
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3502-9
Type :
conf
DOI :
10.1109/ICDM.2008.82
Filename :
4781169
Link To Document :
بازگشت