• DocumentCode
    3167121
  • Title

    Active Learning from Data Streams

  • Author

    Zhu, Xingquan ; Zhang, Peng ; Lin, Xiaodong ; Shi, Yong

  • Author_Institution
    Florida Atlantic Univ., Boca Raton
  • fYear
    2007
  • fDate
    28-31 Oct. 2007
  • Firstpage
    757
  • Lastpage
    762
  • Abstract
    In this paper, we address a new research problem on active learning from data streams where data volumes grow continuously and labeling all data is considered expensive and impractical. The objective is to label a small portion of stream data from which a model is derived to predict newly arrived instances as accurate as possible. In order to tackle the challenges raised by data streams´ dynamic nature, we propose a classifier ensembling based active learning framework which selectively labels instances from data streams to build an accurate classifier. A minimal variance principle is introduced to guide instance labeling from data streams. In addition, a weight updating rule is derived to ensure that our instance labeling process can adaptively adjust to dynamic drifting concepts in the data. Experimental results on synthetic and real-world data demonstrate the performances of the proposed efforts in comparison with other simple approaches.
  • Keywords
    data handling; pattern classification; classifier ensembling based active learning framework; data streams; instance labeling; minimal variance principle; Accuracy; Association rules; Computer science; Data engineering; Data mining; Decision making; Labeling; Predictive models; USA Councils; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
  • Conference_Location
    Omaha, NE
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3018-5
  • Type

    conf

  • DOI
    10.1109/ICDM.2007.101
  • Filename
    4470323