• DocumentCode
    3457578
  • Title

    An Outlier Mining Algorithm in High-Dimension Based on Single-Parameter-k Local Density

  • Author

    Huang, WeiLi ; Wu, Di ; Ren, Jiadong

  • Author_Institution
    Dept. of Inf. & Electron. Eng., Hebei Univ. of Eng., Handan, China
  • fYear
    2009
  • fDate
    7-9 Dec. 2009
  • Firstpage
    1192
  • Lastpage
    1195
  • Abstract
    As one of the most important problems in data mining, many studies have been done on mining outliers. However, mining outliers in high-dimension has not been well addressed. In this paper, the concepts of reference radius and local deviation index are defined. A novel algorithm OMHKLD based on single-parameter-k local density in high-dimension for mining outliers is proposed. According to a new clustering algorithm KLDCA based on single-parameter-k local density, the data set is divided into outliers and cluster points. The cluster points are eliminated directly. The outlier candidate set is obtained. Moreover, take advantage of the idea of LOF, our algorithm indicates the degree of the objects in outlier candidate set with the local deviation index. The optimal outlier set can be gained. The experimental results and analysis show that the performance of OMHKLD is better than DBSCAN and LOF in improving the clustering quality and reducing memory usage and time cost.
  • Keywords
    data mining; pattern clustering; clustering algorithm KLDCA; data mining; local deviation index; optimal outlier set; outlier mining algorithm; reference radius; single-parameter-k local density; Clustering algorithms; Costs; Data engineering; Data mining; Databases; Educational institutions; Information science; Machine learning; Object detection; Performance analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovative Computing, Information and Control (ICICIC), 2009 Fourth International Conference on
  • Conference_Location
    Kaohsiung
  • Print_ISBN
    978-1-4244-5543-0
  • Type

    conf

  • DOI
    10.1109/ICICIC.2009.98
  • Filename
    5412402