Title :
Stream Classification with Recurring and Novel Class Detection Using Class-Based Ensemble
Author :
Al-Khateeb, T. ; Masud, M.M. ; Khan, Latifur ; Aggarwal, Charu ; Jiawei Han ; Thuraisingham, Bhavani
Author_Institution :
Dept. of Comput. Sc., Univ. of Texas at Dallas, Dallas, TX, USA
Abstract :
Concept-evolution has recently received a lot of attention in the context of mining data streams. Concept-evolution occurs when a new class evolves in the stream. Although many recent studies address this issue, most of them do not consider the scenario of recurring classes in the stream. A class is called recurring if it appears in the stream, disappears for a while, and then reappears again. Existing data stream classification techniques either misclassify the recurring class instances as another class, or falsely identify the recurring classes as novel. This increases the prediction error of the classifiers, and in some cases causes unnecessary waste in memory and computational resources. In this paper we address the recurring class issue by proposing a novel "class-based" ensemble technique, which substitutes the traditional "chunk-based" ensemble approaches and correctly distinguishes between a recurring class and a novel one. We analytically and experimentally confirm the superiority of our method over state-of-the-art techniques.
Keywords :
data mining; pattern classification; chunk- based ensemble approaches; class-based ensemble; class-based ensemble technique; classifier prediction error; computational resources; concept evolution; data stream classification techniques; data stream mining; memory waste; novel class detection; recurring class detection; state-of-the-art techniques; stream classification; Classification algorithms; Data models; Educational institutions; Electronic mail; Humans; Prediction algorithms; Training; novel class; recurring class; stream classification;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.125