Title :
Pool and Accuracy Based Stream Classification: A New Ensemble Algorithm on Data Stream Classification Using Recurring Concepts Detection
Author :
Hosseini, Mohammad Javad ; Ahmadi, Zahra ; Beigy, Hamid
Author_Institution :
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran, Iran
Abstract :
One of the main challenges of data streams is the occurrence of concept drift. Concept drift is the change of target (or feature) distribution, and can occur in different types: sudden, gradual, incremental or recurring. Because of the forgetting mechanism existing in the data stream learning process, recurring concepts has received much attention recently, and became a challenging problem. This paper tries to exploit the existence of recurring concepts in the learning process and improve the classification of data streams. It uses a pool of concepts to detect the reoccurrence of a concept using two methods: a Bayesian, and a heuristic method. Two approaches are used in the classification process: active classifier and weighted classifier. Experimental results show the effectiveness of the proposed method with respect to the Conceptual Clustering and Prediction (CCP) framework.
Keywords :
belief networks; learning (artificial intelligence); pattern classification; Bayesian method; accuracy based stream classification; active classifier; conceptual clustering and prediction framework; data stream classification; ensemble algorithm; forgetting mechanism; heuristic method; learning process; pool based stream classification; recurring concepts detection; weighted classifier; Conferences; Data mining; concept drift; ensemble learning; recurring concepts; stream mining;
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
DOI :
10.1109/ICDMW.2011.137