• DocumentCode
    2953787
  • Title

    Computational intelligence and decision trees for missing data estimation

  • Author

    Ssali, George ; Marwala, Tshilidzi

  • Author_Institution
    Sch. of Electr. Eng., Univ. of the Witwatersrand, Johannesburg
  • fYear
    2008
  • fDate
    1-8 June 2008
  • Firstpage
    201
  • Lastpage
    207
  • Abstract
    This paper introduces a novel paradigm to impute missing data that combines a decision tree with an auto-associative neural network (AANN) based model and a principal component analysis-neural network (PCA-NN) based model. For each model, the decision tree is used to predict search bounds for a genetic algorithm that minimise an error function derived from the respective model. The modelspsila ability to impute missing data is tested and compared using HIV sero-prevalance data. Results indicate an average increase in accuracy of 13% with the AANN based modelpsilas average accuracy increasing from 75.8% to 86.3% while that of the PCA-NN based model increasing from 66.1 % to 81.6%.
  • Keywords
    decision trees; estimation theory; genetic algorithms; minimisation; neural nets; principal component analysis; very large databases; auto-associative neural network; computational intelligence; decision tree; error function minimisation; genetic algorithm; large database; missing data estimation; missing data imputation; principal component analysis; search bound prediction; Africa; Computational intelligence; Databases; Decision trees; Genetic algorithms; Human immunodeficiency virus; Information analysis; Neural networks; Predictive models; Principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-1820-6
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2008.4633790
  • Filename
    4633790