Title :
Hellinger distance based drift detection for nonstationary environments
Author :
Ditzler, Gregory ; Polikar, Robi
Author_Institution :
Dept. of Electr. & Comput. Eng., Rowan Univ., Glassboro, NJ, USA
Abstract :
Most machine learning algorithms, including many online learners, assume that the data distribution to be learned is fixed. There are many real-world problems where the distribution of the data changes as a function of time. Changes in nonstationary data distributions can significantly reduce the generalization ability of the learning algorithm on new or field data, if the algorithm is not equipped to track such changes. When the stationary data distribution assumption does not hold, the learner must take appropriate actions to ensure that the new/relevant information is learned. On the other hand, data distributions do not necessarily change continuously, necessitating the ability to monitor the distribution and detect when a significant change in distribution has occurred. In this work, we propose and analyze a feature based drift detection method using the Hellinger distance to detect gradual or abrupt changes in the distribution.
Keywords :
data mining; generalisation (artificial intelligence); learning (artificial intelligence); Hellinger distance; data mining; drift detection; generalization ability; machine learning algorithm; nonstationary data distribution; nonstationary environment; Algorithm design and analysis; Current measurement; Detection algorithms; Histograms; Monitoring; Training; concept drift; drift detection; nonstationary environments;
Conference_Titel :
Computational Intelligence in Dynamic and Uncertain Environments (CIDUE), 2011 IEEE Symposium on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-9930-4
DOI :
10.1109/CIDUE.2011.5948491