DocumentCode :
2953787
Title :
Computational intelligence and decision trees for missing data estimation
Author :
Ssali, George ; Marwala, Tshilidzi
Author_Institution :
Sch. of Electr. Eng., Univ. of the Witwatersrand, Johannesburg
fYear :
2008
fDate :
1-8 June 2008
Firstpage :
201
Lastpage :
207
Abstract :
This paper introduces a novel paradigm to impute missing data that combines a decision tree with an auto-associative neural network (AANN) based model and a principal component analysis-neural network (PCA-NN) based model. For each model, the decision tree is used to predict search bounds for a genetic algorithm that minimise an error function derived from the respective model. The modelspsila ability to impute missing data is tested and compared using HIV sero-prevalance data. Results indicate an average increase in accuracy of 13% with the AANN based modelpsilas average accuracy increasing from 75.8% to 86.3% while that of the PCA-NN based model increasing from 66.1 % to 81.6%.
Keywords :
decision trees; estimation theory; genetic algorithms; minimisation; neural nets; principal component analysis; very large databases; auto-associative neural network; computational intelligence; decision tree; error function minimisation; genetic algorithm; large database; missing data estimation; missing data imputation; principal component analysis; search bound prediction; Africa; Computational intelligence; Databases; Decision trees; Genetic algorithms; Human immunodeficiency virus; Information analysis; Neural networks; Predictive models; Principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
ISSN :
1098-7576
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
Type :
conf
DOI :
10.1109/IJCNN.2008.4633790
Filename :
4633790
Link To Document :
بازگشت