DocumentCode
2540797
Title
An outlier mining algorithm based on confidence interval
Author
Zhang, Yue ; Yang, Xuehua ; Li, Hang
Author_Institution
Software Coll., Shenyang Normal Univ., Shenyang, China
fYear
2010
fDate
16-18 April 2010
Firstpage
231
Lastpage
234
Abstract
Outlier detection is a hot topic of data mining. After studying the existing classical algorithms of detecting outlier, this paper proposes an outlier mining algorithm based on confidence interval, and makes a new definition for outlier. The method combines mathematical statistics and density-based clustering algorithm. It clustering firstly with DBSCAN algorithm, obtains credible sample and suspicious outliers. Secondly, a confidence interval is obtained based on credible sample, then suspicious outliers will be detected and disposed using the confidence interval. The experiment results on IRIS show that this algorithm can detect outliers effectively.
Keywords
data mining; pattern clustering; statistics; DBSCAN algorithm; confidence interval; data mining; density-based clustering algorithm; mathematical statistics; outlier detection; outlier mining algorithm; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data warehouses; Educational institutions; Iris; Sampling methods; Software algorithms; Statistics; Clustering; Confidence Interval; Outlier; Stratified sampling;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Management and Engineering (ICIME), 2010 The 2nd IEEE International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4244-5263-7
Electronic_ISBN
978-1-4244-5265-1
Type
conf
DOI
10.1109/ICIME.2010.5477465
Filename
5477465
Link To Document