DocumentCode :
3574473
Title :
Evolving clustering based data imputation
Author :
Gautam, Chandan ; Ravi, Vadlamani
Author_Institution :
SCIS, Univ. of Hyderabad, Hyderabad, India
fYear :
2014
Firstpage :
1763
Lastpage :
1769
Abstract :
Missing data is an inevitable problem in many disciplines. In this paper, we employed an Evolving Clustering Method (ECM) based imputation method and performed sensitivity analysis of the influence of threshold value (Dthr) on imputation results over 12 datasets. We experimented on a large range of Dthr values from 0.001 to 0.999, in steps of 0.001, in order to see which value of Dthr would perform better imputation compared to K-Means+MLP. Thereby, we provided an upper bound for the Dthr value in ECM algorithm. Further, we tested the effectiveness of the online clustering based imputation method on 12 datasets under 10-fold cross validation set up. ECM yielded better performance compared to K-Means + Multilayer perceptron hybrid algorithm, appearing in literature. It is due to strong local learning capability of ECM and selection of an optimal Dthr value.
Keywords :
learning (artificial intelligence); pattern clustering; ECM; clustering based data imputation; evolving clustering method; local learning capability; missing data; online clustering based imputation method; perceptron hybrid algorithm; sensitivity analysis; Banking; Clustering algorithms; Clustering methods; Computers; Electronic countermeasures; Iris; Sensitivity analysis; Evolving Clustering Method; Imputation; Local Learning; Missing Data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuit, Power and Computing Technologies (ICCPCT), 2014 International Conference on
Print_ISBN :
978-1-4799-2395-3
Type :
conf
DOI :
10.1109/ICCPCT.2014.7054988
Filename :
7054988
Link To Document :
بازگشت