• DocumentCode
    3716583
  • Title

    REODM: Identify Local Outliers in Big Data

  • Author

    Yongchang Gao;Haowen Guan;Bin Gong

  • Author_Institution
    Dept. of Comput. Sci. &
  • fYear
    2015
  • Firstpage
    825
  • Lastpage
    830
  • Abstract
    Outlier detection is now widely used in various fields. It attracts more and more interests in research. The density based outlier detection methods and the distance based outlier detection methods are the most frequently used outlier detection methods. In big data, the size and dimensions of data is very large. Those features make the conventional methods not suitable for big data. According to the features of big data, we propose a novel REODM method. Based on the rough set theory, we propose a relevant set, which is used to replace the universe set to improve the calculation speed. The REODM method is based on the ensemble learning. It is robust and accurate. We compare the performance of REODM, LOF and KNN methods. The experimental results show that the recall rate and precision of REODM is much better than LOF and KNN method.
  • Keywords
    "Big data","Set theory","Detection algorithms","Approximation methods","Robustness","Data mining","Medical diagnostic imaging"
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing (CIT/IUCC/DASC/PICOM), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/CIT/IUCC/DASC/PICOM.2015.122
  • Filename
    7363162