• DocumentCode
    671429
  • Title

    Concept drift detection for online class imbalance learning

  • Author

    Shuo Wang ; Minku, Leandro L. ; Ghezzi, Diego ; Caltabiano, Daniele ; Tino, Peter ; Xin Yao

  • Author_Institution
    Sch. of Comput. Sci., Univ. of Birmingham, Birmingham, UK
  • fYear
    2013
  • fDate
    4-9 Aug. 2013
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Concept drift detection methods are crucial components of many online learning approaches. Accurate drift detections allow prompt reaction to drifts and help to maintain high performance of online models over time. Although many methods have been proposed, no attention has been given to data streams with imbalanced class distributions, which commonly exist in real-world applications, such as fault diagnosis of control systems and intrusion detection in computer networks. This paper studies the concept drift problem for online class imbalance learning. We look into the impact of concept drift on single-class performance of online models based on three types of classifiers, under seven different scenarios with the presence of class imbalance. The analysis reveals that detecting drift in imbalanced data streams is a more difficult task than in balanced ones. Minority-class recall suffers from a significant drop after the drift involving the minority class. Overall accuracy is not suitable for drift detection. Based on the findings, we propose a new detection method DDM-OCI derived from the existing method DDM. DDM-OCI monitors minority-class recall online to capture the drift. The results show a quick response of the online model working with DDM-OCI to the new concept.
  • Keywords
    computer aided instruction; data analysis; DDM-OCI; accurate drift detections; computer networks; concept drift detection; concept drift problem; control systems; fault diagnosis; imbalanced class distributions; imbalanced data streams; intrusion detection; minority-class recall online; online class imbalance learning; online learning; online models; real-world applications; single-class performance; Accuracy; Bagging; Computational modeling; Decision trees; Detectors; Monitoring; Neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2013 International Joint Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-6128-6
  • Type

    conf

  • DOI
    10.1109/IJCNN.2013.6706768
  • Filename
    6706768