Title :
An Outlier Mining Algorithm in High-Dimension Based on Single-Parameter-k Local Density
Author :
Huang, WeiLi ; Wu, Di ; Ren, Jiadong
Author_Institution :
Dept. of Inf. & Electron. Eng., Hebei Univ. of Eng., Handan, China
Abstract :
As one of the most important problems in data mining, many studies have been done on mining outliers. However, mining outliers in high-dimension has not been well addressed. In this paper, the concepts of reference radius and local deviation index are defined. A novel algorithm OMHKLD based on single-parameter-k local density in high-dimension for mining outliers is proposed. According to a new clustering algorithm KLDCA based on single-parameter-k local density, the data set is divided into outliers and cluster points. The cluster points are eliminated directly. The outlier candidate set is obtained. Moreover, take advantage of the idea of LOF, our algorithm indicates the degree of the objects in outlier candidate set with the local deviation index. The optimal outlier set can be gained. The experimental results and analysis show that the performance of OMHKLD is better than DBSCAN and LOF in improving the clustering quality and reducing memory usage and time cost.
Keywords :
data mining; pattern clustering; clustering algorithm KLDCA; data mining; local deviation index; optimal outlier set; outlier mining algorithm; reference radius; single-parameter-k local density; Clustering algorithms; Costs; Data engineering; Data mining; Databases; Educational institutions; Information science; Machine learning; Object detection; Performance analysis;
Conference_Titel :
Innovative Computing, Information and Control (ICICIC), 2009 Fourth International Conference on
Conference_Location :
Kaohsiung
Print_ISBN :
978-1-4244-5543-0
DOI :
10.1109/ICICIC.2009.98