DocumentCode
3570941
Title
Outlier detection based on k-neighborhood MST
Author
Qingsheng Zhu ; Xiaogang Fan ; Ji Feng
Author_Institution
Chongqing Key Lab. of Software Theor. & Technol. Coll. of Comput. Sci., Chongqing Univ., Chongqing, China
fYear
2014
Firstpage
718
Lastpage
724
Abstract
Outlier detection is an important task in data mining. It is mainly used for finding strange mechanism or potential danger. This paper presents an outlier detection algorithm based on k-neighborhood minimum spanning tree(MST). This algorithm is applicable to data sets of any arbitrary shape and density and can effectively detect local outliers and local outlying clusters. Taking density and directional factor into consideration, this algorithm proposes a new dissimilarity measure based on k-neighborhood. Then a minimum spanning tree (MST) is built based on this k-neighborhood dissimilarity measure. Finally, the tree is progressively constrained to cutting so that the outliers can be found. Compared with algorithm LOF, COF, KNN and INFLO, the result proves the effectiveness and excellence of this new algorithm.
Keywords
data mining; trees (mathematics); data mining; dissimilarity measure; k-neighborhood MST; minimum spanning tree; outlier detection; Clustering algorithms; Data mining; Density measurement; Euclidean distance; Partitioning algorithms; Shape; Time complexity; Dissimilarity; MST; Outlier detection; Outlying cluster; k-neighborhood;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
Type
conf
DOI
10.1109/IRI.2014.7051960
Filename
7051960
Link To Document