Title :
An Outlier Detection Method Based on Clustering
Author :
Pamula, Rajendra ; Deka, Jatindra Kumar ; Nandi, Sukumar
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
Abstract :
In this paper we propose a clustering based method to capture outliers. We apply K-means clustering algorithm to divide the data set into clusters. The points which are lying near the centroid of the cluster are not probable candidate for outlier and we can prune out such points from each cluster. Next we calculate a distance based outlier score for remaining points. The computations needed to calculate the outlier score reduces considerably due to the pruning of some points. Based on the outlier score we declare the top n points with the highest score as outliers. The experimental results using real data set demonstrate that even though the number of computations is less, the proposed method performs better than the existing method.
Keywords :
pattern clustering; statistical analysis; K-means clustering algorithm; clustering based method; distance based outlier score; outlier detection method; Cancer; Clustering algorithms; Clustering methods; Data mining; Medical diagnosis; Spatial databases; Cluster; Outlier; distance-based;
Conference_Titel :
Emerging Applications of Information Technology (EAIT), 2011 Second International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-9683-9
DOI :
10.1109/EAIT.2011.25