DocumentCode
2953787
Title
Computational intelligence and decision trees for missing data estimation
Author
Ssali, George ; Marwala, Tshilidzi
Author_Institution
Sch. of Electr. Eng., Univ. of the Witwatersrand, Johannesburg
fYear
2008
fDate
1-8 June 2008
Firstpage
201
Lastpage
207
Abstract
This paper introduces a novel paradigm to impute missing data that combines a decision tree with an auto-associative neural network (AANN) based model and a principal component analysis-neural network (PCA-NN) based model. For each model, the decision tree is used to predict search bounds for a genetic algorithm that minimise an error function derived from the respective model. The modelspsila ability to impute missing data is tested and compared using HIV sero-prevalance data. Results indicate an average increase in accuracy of 13% with the AANN based modelpsilas average accuracy increasing from 75.8% to 86.3% while that of the PCA-NN based model increasing from 66.1 % to 81.6%.
Keywords
decision trees; estimation theory; genetic algorithms; minimisation; neural nets; principal component analysis; very large databases; auto-associative neural network; computational intelligence; decision tree; error function minimisation; genetic algorithm; large database; missing data estimation; missing data imputation; principal component analysis; search bound prediction; Africa; Computational intelligence; Databases; Decision trees; Genetic algorithms; Human immunodeficiency virus; Information analysis; Neural networks; Predictive models; Principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location
Hong Kong
ISSN
1098-7576
Print_ISBN
978-1-4244-1820-6
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2008.4633790
Filename
4633790
Link To Document