DocumentCode
2611918
Title
An Approach for Treatment of the Incomplete Data Based on WaveCluster and Weighted 1-Nearest Neighbor
Author
Li, Xingyi ; Lu, Junyun ; Shi, Huaji ; Ma, Suqin
Author_Institution
Sch. of Comput. Sci. & Telecommun. Eng., Jiangsu Univ., Zhenjiang, China
fYear
2009
fDate
17-20 April 2009
Firstpage
3
Lastpage
8
Abstract
For the incomplete data that usually exists in the process of pretreatment, this article presents an approach for treatment of the incomplete data based on WaveCluster and weighted 1-Nearest Neighbor (1-NN).The proposed method firstly carries out the WaveCluster in the complete record set of the whole set, which can reduce the volume of comparative data and rule out outliers, improve computational efficiency of the algorithm and the clustering accuracy. Then, the weighted 1-NN method is used, according to the contribution attributes made to the classification in the algorithm, the information gain of attribute is calculated and each attribute is endowed with certain weight using in the nearest neighbor measure, thus it can enhance the filling precision of the missing value. Experimental results show the proposed method is an appropriate and effective method in treatment of the incomplete data.
Keywords
data handling; pattern clustering; WaveCluster; attribute information gain; clustering accuracy; comparative data volume; incomplete data treatment; pretreatment process; weighted 1-nearest neighbor; Classification algorithms; Clustering algorithms; Computational efficiency; Computer science; Data engineering; Data mining; Filling; Gain measurement; Nearest neighbor searches; Springs; 1-nearest neighbor; WaveCluster; incomplete data; information gain;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology - Spring Conference, 2009. IACSITSC '09. International Association of
Conference_Location
Singapore
Print_ISBN
978-0-7695-3653-8
Type
conf
DOI
10.1109/IACSIT-SC.2009.38
Filename
5169300
Link To Document