DocumentCode :
2774817
Title :
An Outlier Detection Method Based on Clustering
Author :
Pamula, Rajendra ; Deka, Jatindra Kumar ; Nandi, Sukumar
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear :
2011
fDate :
19-20 Feb. 2011
Firstpage :
253
Lastpage :
256
Abstract :
In this paper we propose a clustering based method to capture outliers. We apply K-means clustering algorithm to divide the data set into clusters. The points which are lying near the centroid of the cluster are not probable candidate for outlier and we can prune out such points from each cluster. Next we calculate a distance based outlier score for remaining points. The computations needed to calculate the outlier score reduces considerably due to the pruning of some points. Based on the outlier score we declare the top n points with the highest score as outliers. The experimental results using real data set demonstrate that even though the number of computations is less, the proposed method performs better than the existing method.
Keywords :
pattern clustering; statistical analysis; K-means clustering algorithm; clustering based method; distance based outlier score; outlier detection method; Cancer; Clustering algorithms; Clustering methods; Data mining; Medical diagnosis; Spatial databases; Cluster; Outlier; distance-based;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Emerging Applications of Information Technology (EAIT), 2011 Second International Conference on
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-9683-9
Type :
conf
DOI :
10.1109/EAIT.2011.25
Filename :
5734938
Link To Document :
بازگشت