DocumentCode :
178361
Title :
Efficient Active Novel Class Detection for Data Stream Classification
Author :
Bouguelia, M.-R. ; Belaid, Y. ; Belaid, A.
Author_Institution :
LORIA, Univ. de Lorraine, Vandoeuvre-les-Nancy, France
fYear :
2014
fDate :
24-28 Aug. 2014
Firstpage :
2826
Lastpage :
2831
Abstract :
One substantial aspect of data stream classification is the possible appearance of novel unseen classes which must be identified in order to avoid confusion with existing classes. Detecting such new classes is omitted by most existing techniques and rarely addressed in the literature. We address this issue and propose an efficient method to identify novel class emergence in a multi-class data stream. The proposed method incrementally maintains a covered feature space of existing (known) classes. An incoming data point is designated as "insider" or "outsider" depending on whether it lies inside or outside the covered space area. An insider represents a possible instance of an existing class, while an outsider may be an instance of a possible novel class. The proposed method is able to iteratively select those insiders (resp. outsiders) that are more likely to be members of a novel (resp. an existing) class, and eventually distinguish the actual novel and existing classes accurately. We show how to actively query the labels of the identified novel class instances that are most uncertain. The method also allows us to balance between the rapidity of the novelty detection and its efficiency. Experiments using real world data prove the effectiveness of our approach for both the novel class detection and classification accuracy.
Keywords :
pattern classification; query processing; support vector machines; active novel class detection; class classification accuracy; class detection accuracy; class emergence; class instances; class members; covered feature space; covered space area; incoming data point; insider data point; known classes; label query; multiclass data stream classification; novelty detection; outsider data point; unseen class appearance identification; Accuracy; Heuristic algorithms; Labeling; Learning systems; Manuals; Support vector machines; Topology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location :
Stockholm
ISSN :
1051-4651
Type :
conf
DOI :
10.1109/ICPR.2014.487
Filename :
6977200
Link To Document :
بازگشت