DocumentCode
3128768
Title
Drift Detection Using Uncertainty Distribution Divergence
Author
Lindstrom, Patrick ; Namee, Brian Mac ; Delany, Sarah Jane
Author_Institution
Sch. of Comput., Dublin Inst. of Technol., Dublin, Ireland
fYear
2011
fDate
11-11 Dec. 2011
Firstpage
604
Lastpage
608
Abstract
Concept drift is believed to be prevalent in most data gathered from naturally occurring processes and thus warrants research by the machine learning community. There are a myriad of approaches to concept drift handling which have been shown to handle concept drift with varying degrees of success. However, most approaches make the key assumption that the labelled data will be available at no labelling cost shortly after classification, an assumption which is often violated. The high labelling cost in many domains provides a strong motivation to reduce the number of labelled instances required to handle concept drift. Explicit detection approaches that do not require labelled instances to detect concept drift show great promise for achieving this. Our approach Confidence Distribution Batch Detection (CDBD) provides a signal correlated to changes in concept without using labelled data. We also show how this signal combined with a trigger and a rebuild policy can maintain classifier accuracy while using a limited amount of labelled data.
Keywords
data handling; learning (artificial intelligence); CDBD; concept drift; confidence distribution batch detection; drift detection; explicit detection; machine learning; uncertainty distribution divergence; Accuracy; Conferences; Data mining; Labeling; Machine learning; Training; Training data; classifier confidence; concept drift; explicit drift detection; labelling cost;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location
Vancouver, BC
Print_ISBN
978-1-4673-0005-6
Type
conf
DOI
10.1109/ICDMW.2011.70
Filename
6137435
Link To Document