• DocumentCode
    3570941
  • Title

    Outlier detection based on k-neighborhood MST

  • Author

    Qingsheng Zhu ; Xiaogang Fan ; Ji Feng

  • Author_Institution
    Chongqing Key Lab. of Software Theor. & Technol. Coll. of Comput. Sci., Chongqing Univ., Chongqing, China
  • fYear
    2014
  • Firstpage
    718
  • Lastpage
    724
  • Abstract
    Outlier detection is an important task in data mining. It is mainly used for finding strange mechanism or potential danger. This paper presents an outlier detection algorithm based on k-neighborhood minimum spanning tree(MST). This algorithm is applicable to data sets of any arbitrary shape and density and can effectively detect local outliers and local outlying clusters. Taking density and directional factor into consideration, this algorithm proposes a new dissimilarity measure based on k-neighborhood. Then a minimum spanning tree (MST) is built based on this k-neighborhood dissimilarity measure. Finally, the tree is progressively constrained to cutting so that the outliers can be found. Compared with algorithm LOF, COF, KNN and INFLO, the result proves the effectiveness and excellence of this new algorithm.
  • Keywords
    data mining; trees (mathematics); data mining; dissimilarity measure; k-neighborhood MST; minimum spanning tree; outlier detection; Clustering algorithms; Data mining; Density measurement; Euclidean distance; Partitioning algorithms; Shape; Time complexity; Dissimilarity; MST; Outlier detection; Outlying cluster; k-neighborhood;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration (IRI), 2014 IEEE 15th International Conference on
  • Type

    conf

  • DOI
    10.1109/IRI.2014.7051960
  • Filename
    7051960