DocumentCode
3716583
Title
REODM: Identify Local Outliers in Big Data
Author
Yongchang Gao;Haowen Guan;Bin Gong
Author_Institution
Dept. of Comput. Sci. &
fYear
2015
Firstpage
825
Lastpage
830
Abstract
Outlier detection is now widely used in various fields. It attracts more and more interests in research. The density based outlier detection methods and the distance based outlier detection methods are the most frequently used outlier detection methods. In big data, the size and dimensions of data is very large. Those features make the conventional methods not suitable for big data. According to the features of big data, we propose a novel REODM method. Based on the rough set theory, we propose a relevant set, which is used to replace the universe set to improve the calculation speed. The REODM method is based on the ensemble learning. It is robust and accurate. We compare the performance of REODM, LOF and KNN methods. The experimental results show that the recall rate and precision of REODM is much better than LOF and KNN method.
Keywords
"Big data","Set theory","Detection algorithms","Approximation methods","Robustness","Data mining","Medical diagnostic imaging"
Publisher
ieee
Conference_Titel
Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/CIT/IUCC/DASC/PICOM.2015.122
Filename
7363162
Link To Document