Title :
Adaptive Supervised Learning Model for Training Set Selection under Concept Drift Data Streams
Author :
Patil, Pramod D. ; Kulkarni, Parag
Author_Institution :
Dept. of Comput. Eng., Padmashree Dr.D.Y.Patil Inst. of Eng. & Technol., Pune, India
Abstract :
Dynamic changes are a part of everyday life. When there is a change in data, the classification models need to be adaptive to the changes. In this paper we propose adaptive supervised learning model for training set selection under concept drift data streams. This paper focuses on adaptive supervised learning techniques, where adaptivity to changes in data over time is achieved by selective training set methodology. These selective training set methods typically can be used plugging in various base classifiers. In this work we consider accuracy (generalization error) as the primary performance measure for concept drift learners. In this paper our research follows the three main drift types, starting from sudden drift, via gradual drift to reoccurring concepts. We give methodological contributions to concept drift phenomenon under real time application i.e. Electricity pricing contexts and expected change types. In this paper, a proposed methodology consist of four algorithms, first algorithm i.e. Optimal Window Resizing Algorithm under sudden drift to determine the optimal window length at a given time, identify to what extent a change point is different from the start of the training window and how this difference can be used to improve the accuracy of an adaptive learner. Second algorithm i.e. Gradual Drift algorithm which would unify two selection criteria: similarity in time and feature space to improve accuracy of an adaptive learner. Third algorithm i.e. Reoccurring Concept drift where previously seen patterns reoccur, but it is not certain when exactly and in what form they will repeat. Last algorithm i.e. Dynamic drift detection. In comparison to other methods, our proposed algorithms are faster and memory-less, a requirement for streaming applications. A proposed methodology is tested on Elec2 data, we get less error rate.
Keywords :
learning (artificial intelligence); pattern classification; Elec2 data; adaptive learner; adaptive supervised learning model; base classifiers; classification models; concept drift data streams; concept drift learners; dynamic drift detection; generalization error; gradual drift algorithm; optimal window resizing algorithm; primary performance measure; reoccurring concept drift; selection criteria; selective training set methodology; streaming applications; training set selection; training window; Accuracy; Classification algorithms; Heuristic algorithms; Mathematical model; Supervised learning; Testing; Training; Adaptive Training set; Concept Drift; Data streams; Supervised learning;
Conference_Titel :
Cloud & Ubiquitous Computing & Emerging Technologies (CUBE), 2013 International Conference on
Conference_Location :
Pune
Print_ISBN :
978-1-4799-2234-5
DOI :
10.1109/CUBE.2013.17