DocumentCode
3574473
Title
Evolving clustering based data imputation
Author
Gautam, Chandan ; Ravi, Vadlamani
Author_Institution
SCIS, Univ. of Hyderabad, Hyderabad, India
fYear
2014
Firstpage
1763
Lastpage
1769
Abstract
Missing data is an inevitable problem in many disciplines. In this paper, we employed an Evolving Clustering Method (ECM) based imputation method and performed sensitivity analysis of the influence of threshold value (Dthr) on imputation results over 12 datasets. We experimented on a large range of Dthr values from 0.001 to 0.999, in steps of 0.001, in order to see which value of Dthr would perform better imputation compared to K-Means+MLP. Thereby, we provided an upper bound for the Dthr value in ECM algorithm. Further, we tested the effectiveness of the online clustering based imputation method on 12 datasets under 10-fold cross validation set up. ECM yielded better performance compared to K-Means + Multilayer perceptron hybrid algorithm, appearing in literature. It is due to strong local learning capability of ECM and selection of an optimal Dthr value.
Keywords
learning (artificial intelligence); pattern clustering; ECM; clustering based data imputation; evolving clustering method; local learning capability; missing data; online clustering based imputation method; perceptron hybrid algorithm; sensitivity analysis; Banking; Clustering algorithms; Clustering methods; Computers; Electronic countermeasures; Iris; Sensitivity analysis; Evolving Clustering Method; Imputation; Local Learning; Missing Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on
Print_ISBN
978-1-4799-2395-3
Type
conf
DOI
10.1109/ICCPCT.2014.7054988
Filename
7054988
Link To Document