DocumentCode :
2787398
Title :
MaxMin: An Unbiased Outlier Detection Method
Author :
Yan Li-rong ; Wu Yi-bo
Author_Institution :
Dept. of Inf., Wuhan Gen. Hosp. of Guangzhou Command, Wuhan, China
fYear :
2015
fDate :
24-26 April 2015
Firstpage :
297
Lastpage :
303
Abstract :
Purpose To study the outlier detection methods and supply practical tools for data mining. Methods A novel parametric outlier detection method called maximum minimum (MaxMin) was presented to solve the crucial outlier detection issue, i.e., the estimation of the distribution parameters of the target normal distribution ´contaminated´ with the outliers. Through an iterative rejection sampling procedure, a normal parametric probability density function was obtained to maximize the minimum ratio of nonparametric probability density to parametric probability density which is calculated based on each single sample. Simulation datasets, consisting of 2000 samples following the standard normal distribution contaminated by different portions of outlier following a Weibull distribution, were analysed to evaluate the performance of the MaxMin method. Results The simulation revealed that the distribution parameters could be precisely and stably estimated regardless of the proportion of the outliers in the whole dataset. The MaxMin procedure was mathematically demonstrated to be able to obtain the unbiased estimation of the distribution parameters when the outliers were not in the kernel region of the target distribution (e.g. [μ - σ, μ + σ]). Conclusion The unbiased estimation of the target distribution can be obtained by using MaxMin method under a relatively weak assumption. The outlier detection method has high precision and robustness.
Keywords :
data mining; probability; MaxMin method; Weibull distribution; data mining; iterative rejection sampling procedure; nonparametric probability density function; unbiased outlier detection method; Estimation; Gaussian distribution; Parameter estimation; Probability density function; Sociology; Standards; MaxMin; outlier detection; parametric method; unbiased estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Control Engineering (ICISCE), 2015 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4673-6849-0
Type :
conf
DOI :
10.1109/ICISCE.2015.73
Filename :
7120613
Link To Document :
بازگشت