• DocumentCode
    2265064
  • Title

    An Active Learning Method for Mining Time-Changing Data Streams

  • Author

    Huang, Shucheng

  • Author_Institution
    Sch. of Comput. Sci. & Eng., JiangSu Univ. of Sci. & Technol., Zhenjiang
  • Volume
    2
  • fYear
    2008
  • fDate
    20-22 Dec. 2008
  • Firstpage
    548
  • Lastpage
    552
  • Abstract
    Many applications generate continuous, time-changing data streams. Mining it for an adaptive classifier is of great interest and challenge. Many previous efforts impractically assume the labeled data is available and can be mined at anytime. In this paper, we propose an effective active learning method to mine time-changing data streams efficiently. It designs a way to monitoring the possible changes on the fly without need knowing the labeled data. Upon the suspected changes are indicated, it employs a light-weight uncertainty sampling algorithm to choose the most informative instances to label. With these representative labeled instances, it tests the significance of the suspected changes. If the changes indeed cause significant performance deterioration of the current classifier, it reconstructs the old model. Thus, our method can reliably detect significant changes, quickly adapt to concept-drift, and result effective models. Experimental results from real-world data confirm the advantages of our method.
  • Keywords
    data mining; database management systems; learning (artificial intelligence); pattern classification; active learning method; adaptive classifying model; light-weight uncertainty sampling algorithm; representative labeled instance; time-changing data stream mining; Application software; Costs; Data mining; Decision trees; Frequency; Learning systems; Monitoring; Sampling methods; Statistical distributions; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3497-8
  • Type

    conf

  • DOI
    10.1109/IITA.2008.136
  • Filename
    4739824