• DocumentCode
    3574473
  • Title

    Evolving clustering based data imputation

  • Author

    Gautam, Chandan ; Ravi, Vadlamani

  • Author_Institution
    SCIS, Univ. of Hyderabad, Hyderabad, India
  • fYear
    2014
  • Firstpage
    1763
  • Lastpage
    1769
  • Abstract
    Missing data is an inevitable problem in many disciplines. In this paper, we employed an Evolving Clustering Method (ECM) based imputation method and performed sensitivity analysis of the influence of threshold value (Dthr) on imputation results over 12 datasets. We experimented on a large range of Dthr values from 0.001 to 0.999, in steps of 0.001, in order to see which value of Dthr would perform better imputation compared to K-Means+MLP. Thereby, we provided an upper bound for the Dthr value in ECM algorithm. Further, we tested the effectiveness of the online clustering based imputation method on 12 datasets under 10-fold cross validation set up. ECM yielded better performance compared to K-Means + Multilayer perceptron hybrid algorithm, appearing in literature. It is due to strong local learning capability of ECM and selection of an optimal Dthr value.
  • Keywords
    learning (artificial intelligence); pattern clustering; ECM; clustering based data imputation; evolving clustering method; local learning capability; missing data; online clustering based imputation method; perceptron hybrid algorithm; sensitivity analysis; Banking; Clustering algorithms; Clustering methods; Computers; Electronic countermeasures; Iris; Sensitivity analysis; Evolving Clustering Method; Imputation; Local Learning; Missing Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on
  • Print_ISBN
    978-1-4799-2395-3
  • Type

    conf

  • DOI
    10.1109/ICCPCT.2014.7054988
  • Filename
    7054988