DocumentCode
2986751
Title
Outlier Mining Algorithm Based on Data-Partitioning and Density-Grid
Author
Chang Zheng Xing ; Cheng Long Tang ; Ke Wei
Author_Institution
Sch. of Electron. & Inf. Eng., Liaoning Tech. Univ., Huludao, China
fYear
2012
fDate
7-9 Dec. 2012
Firstpage
880
Lastpage
884
Abstract
Existing outlier mining algorithms such as FOMAUC are based on density-grid. These algorithms have the problems of inefficiency and bad-adaptability for various data sets, so this paper proposes an outlier mining algorithm based on data partitioning and grid-density. Firstly, the technology of data partitioning was applied. Secondly, the nonoutliers were filtered out by cell and the temporary results were saved. Thirdly, the improved CD-Tree was created to maintain the spatial information of the reserved data. After that, the nonoutliers were filtered out by micro-cell and were operated efficiently through two optimization strategies. Finally, followed by mining by data point the resulting outlier set was obtained. Theoretical analysis and the experimental results show that this method is feasible and effective, and that has better scalability for dealing with massive and high dimensional data.
Keywords
data mining; optimisation; CD-Tree; FOMAUC; data mining; data partitioning; data sets; density grid; grid-density; optimization strategies; outlier mining algorithm; spatial information; Algorithm design and analysis; Clustering algorithms; Data mining; Educational institutions; Filtering; Optimization; Partitioning algorithms; cell; cell dimension tree(CD-Tree); data mining; data partitioning; density-grid; micro-cell; outlier data;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Engineering and Communication Technology (ICCECT), 2012 International Conference on
Conference_Location
Liaoning
Print_ISBN
978-1-4673-4499-9
Type
conf
DOI
10.1109/ICCECT.2012.34
Filename
6413973
Link To Document