DocumentCode :
120674
Title :
A review on real time data stream classification and adapting to various concept drift scenarios
Author :
Dongre, Priyanka B. ; Malik, Latesh G.
Author_Institution :
Comput. Sci. & Eng., GHRCE, Nagpur, India
fYear :
2014
fDate :
21-22 Feb. 2014
Firstpage :
533
Lastpage :
537
Abstract :
Data streams are viewed as a sequence of relational tuples (e.g., sensor readings,call records, web page visits) that continuously arrive at time-varying and possibly unbound streams. These data streams are potentially huge in size and thus it is impossible to process many data mining techniques and approaches. Classification techniques fail to successfully process data streams because of two factors: their overwhelming volume and their distinctive feature known as concept drift. Concept drift is a term used to describe changes in the learned structure that occur over time. The occurance of concept drift leads to a drastic drop in classification accuracy. The recognition of concept drift in data streams has led to sliding-window approaches also different approaches to mining data streams with concept drift include instance selection methods, drift detection, ensemble classifiers, option trees and using Hoeffding boundaries to estimate classifier performance. This paper describes the various types of concept drifts that affect the data examples and discusses various approaches in order to handle concept drift scenarios. The aim of this paper is to review and compare single classifier and ensemble approaches to data stream mining respectively.
Keywords :
data mining; estimation theory; pattern classification; time-varying systems; trees (mathematics); Hoeffding boundary; Web page visits; call records; classification accuracy; classification techniques; classifier performance estimation; concept drift scenarios; data mining techniques; data stream mining; drastic drop; drift detection; ensemble classifier; instance selection method; option trees; possibly unbound streams; real time data stream classification; relational tuples; sensor readings; sliding-window approach; time-varying stream; Accuracy; Algorithm design and analysis; Bagging; Classification algorithms; Conferences; Data mining; Heuristic algorithms; Data stream mining; concept drift; dynamic environment; ensemble classifier;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advance Computing Conference (IACC), 2014 IEEE International
Conference_Location :
Gurgaon
Print_ISBN :
978-1-4799-2571-1
Type :
conf
DOI :
10.1109/IAdCC.2014.6779381
Filename :
6779381
Link To Document :
بازگشت