• DocumentCode
    243653
  • Title

    Drift Detection for Multi-label Data Streams Based on Label Grouping and Entropy

  • Author

    Zhongwei Shi ; Yimin Wen ; Chao Feng ; Hai Zhao

  • Author_Institution
    Sch. of Comput. Sci. & Eng., Guilin Univ. of Electron. Technol., Guilin, China
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    724
  • Lastpage
    731
  • Abstract
    Many real-world applications involve multi-label data streams, so effective concept drift detection methods should be able to consider the unique properties of multi-label stream data, such as label dependence. To deal with these challenges, we proposed an efficient and effective method to detect concept drift based on label grouping and entropy for multi-label data. Two methods are proposed to group the set of class labels into different subsets and a multi-label version of entropy was adjusted to measure the distribution of multi-label data. Concept drift was detected by comparing the entropies of the older and the most recent data. The experiments are run on three synthetic datasets and two real-world datasets and the experimental results illustrate the better classification performance of the proposed method for detecting drift.
  • Keywords
    data analysis; data mining; entropy; concept drift detection method; entropy; label grouping; multilabel data stream; Accuracy; Clustering algorithms; Computer aided software engineering; Data mining; Educational institutions; Entropy; Itemsets; concept drift; data stream; entropy; label dependence; multi-label;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.92
  • Filename
    7022667