• DocumentCode
    2983188
  • Title

    GPU-Accelerated Feature Selection for Outlier Detection Using the Local Kernel Density Ratio

  • Author

    Azmandian, F. ; Yilmazer, Ayse ; Dy, Jennifer G. ; Aslam, Javed A. ; Kaeli, David R.

  • Author_Institution
    ECE Dept., Northeastern Univ., Boston, MA, USA
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    51
  • Lastpage
    60
  • Abstract
    Effective outlier detection requires the data to be described by a set of features that captures the behavior of normal data while emphasizing those characteristics of outliers which make them different than normal data. In this work, we present a novel non-parametric evaluation criterion for filter-based feature selection which caters to outlier detection problems. The proposed method seeks the subset of features that represents the inherent characteristics of the normal dataset while forcing outliers to stand out, making them more easily distinguished by outlier detection algorithms. Experimental results on real datasets show the advantage of our feature selection algorithm compared to popular and state-of-the-art methods. We also show that the proposed algorithm is able to overcome the small sample space problem and perform well on highly imbalanced datasets. Furthermore, due to the highly parallelizable nature of the feature selection, we implement the algorithm on a graphics processing unit (GPU) to gain significant speedup over the serial version. The benefits of the GPU implementation are two-fold, as its performance scales very well in terms of the number of features, as well as the number of data points.
  • Keywords
    data mining; graphics processing units; GPU-accelerated feature selection; filter-based feature selection; graphics processing unit; imbalanced datasets; local kernel density ratio; nonparametric evaluation criterion; normal data; outlier detection problems; small sample space problem; Detection algorithms; Educational institutions; Feature extraction; Graphics processing units; Kernel; Search problems; USA Councils; Feature Selection; GPU Acceleration; Imbalanced Data; Outlier Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.51
  • Filename
    6413785