• DocumentCode
    2540797
  • Title

    An outlier mining algorithm based on confidence interval

  • Author

    Zhang, Yue ; Yang, Xuehua ; Li, Hang

  • Author_Institution
    Software Coll., Shenyang Normal Univ., Shenyang, China
  • fYear
    2010
  • fDate
    16-18 April 2010
  • Firstpage
    231
  • Lastpage
    234
  • Abstract
    Outlier detection is a hot topic of data mining. After studying the existing classical algorithms of detecting outlier, this paper proposes an outlier mining algorithm based on confidence interval, and makes a new definition for outlier. The method combines mathematical statistics and density-based clustering algorithm. It clustering firstly with DBSCAN algorithm, obtains credible sample and suspicious outliers. Secondly, a confidence interval is obtained based on credible sample, then suspicious outliers will be detected and disposed using the confidence interval. The experiment results on IRIS show that this algorithm can detect outliers effectively.
  • Keywords
    data mining; pattern clustering; statistics; DBSCAN algorithm; confidence interval; data mining; density-based clustering algorithm; mathematical statistics; outlier detection; outlier mining algorithm; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data warehouses; Educational institutions; Iris; Sampling methods; Software algorithms; Statistics; Clustering; Confidence Interval; Outlier; Stratified sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Management and Engineering (ICIME), 2010 The 2nd IEEE International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-5263-7
  • Electronic_ISBN
    978-1-4244-5265-1
  • Type

    conf

  • DOI
    10.1109/ICIME.2010.5477465
  • Filename
    5477465