DocumentCode :
3716583
Title :
REODM: Identify Local Outliers in Big Data
Author :
Yongchang Gao;Haowen Guan;Bin Gong
Author_Institution :
Dept. of Comput. Sci. &
fYear :
2015
Firstpage :
825
Lastpage :
830
Abstract :
Outlier detection is now widely used in various fields. It attracts more and more interests in research. The density based outlier detection methods and the distance based outlier detection methods are the most frequently used outlier detection methods. In big data, the size and dimensions of data is very large. Those features make the conventional methods not suitable for big data. According to the features of big data, we propose a novel REODM method. Based on the rough set theory, we propose a relevant set, which is used to replace the universe set to improve the calculation speed. The REODM method is based on the ensemble learning. It is robust and accurate. We compare the performance of REODM, LOF and KNN methods. The experimental results show that the recall rate and precision of REODM is much better than LOF and KNN method.
Keywords :
"Big data","Set theory","Detection algorithms","Approximation methods","Robustness","Data mining","Medical diagnostic imaging"
Publisher :
ieee
Conference_Titel :
Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
Type :
conf
DOI :
10.1109/CIT/IUCC/DASC/PICOM.2015.122
Filename :
7363162
Link To Document :
بازگشت