DocumentCode :
2540797
Title :
An outlier mining algorithm based on confidence interval
Author :
Zhang, Yue ; Yang, Xuehua ; Li, Hang
Author_Institution :
Software Coll., Shenyang Normal Univ., Shenyang, China
fYear :
2010
fDate :
16-18 April 2010
Firstpage :
231
Lastpage :
234
Abstract :
Outlier detection is a hot topic of data mining. After studying the existing classical algorithms of detecting outlier, this paper proposes an outlier mining algorithm based on confidence interval, and makes a new definition for outlier. The method combines mathematical statistics and density-based clustering algorithm. It clustering firstly with DBSCAN algorithm, obtains credible sample and suspicious outliers. Secondly, a confidence interval is obtained based on credible sample, then suspicious outliers will be detected and disposed using the confidence interval. The experiment results on IRIS show that this algorithm can detect outliers effectively.
Keywords :
data mining; pattern clustering; statistics; DBSCAN algorithm; confidence interval; data mining; density-based clustering algorithm; mathematical statistics; outlier detection; outlier mining algorithm; Algorithm design and analysis; Clustering algorithms; Data analysis; Data mining; Data warehouses; Educational institutions; Iris; Sampling methods; Software algorithms; Statistics; Clustering; Confidence Interval; Outlier; Stratified sampling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Management and Engineering (ICIME), 2010 The 2nd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5263-7
Electronic_ISBN :
978-1-4244-5265-1
Type :
conf
DOI :
10.1109/ICIME.2010.5477465
Filename :
5477465
Link To Document :
بازگشت