Title :
Clustering-training for Data Stream Mining
Author :
Wu, Shuang ; Yang, Chunyu ; Zhou, Jie
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing
Abstract :
Mining data streams has attracted much attention recently. Labeled samples needed by most current stream classification methods are more difficult and expensive to obtain than unlabeled ones. This paper proposed a semi-supervised learning algorithm - clustering-training to utilize the unlabeled samples. It uses clustering to select confidently unlabeled samples, and uses them to re-train the classifier incrementally. Experiments on synthetic and real data set showed the effectiveness of the proposed algorithm
Keywords :
data mining; learning (artificial intelligence); pattern clustering; clustering-training; data set; data stream mining; semisupervised learning; unlabeled samples; Automation; Clustering algorithms; Data mining; Data processing; Databases; Labeling; Large-scale systems; Sampling methods; Semisupervised learning; Supervised learning;
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
DOI :
10.1109/ICDMW.2006.45