DocumentCode :
3724178
Title :
Towards Mining Trapezoidal Data Streams
Author :
Qin Zhang;Peng Zhang;Guodong Long;Wei Ding;Chengqi Zhang;Xindong Wu
Author_Institution :
QCIS, Univ. of Technol., Sydney, NSW, Australia
fYear :
2015
Firstpage :
1111
Lastpage :
1116
Abstract :
We study a new problem of learning from doubly-streaming data where both data volume and feature space increase over time. We refer to the problem as mining trapezoidal data streams. The problem is challenging because both data volume and feature space are increasing, to which existing online learning, online feature selection and streaming feature selection algorithms are inapplicable. We propose a new Sparse Trapezoidal Streaming Data mining algorithm (STSD) and its two variants which combine online learning and online feature selection to enable learning trapezoidal data streams with infinite training instances and features. Specifically, when new training instances carrying new features arrive, the classifier updates the existing features by following the passive-aggressive update rule used in online learning and updates the new features with the structural risk minimization principle. Feature sparsity is also introduced using the projected truncation techniques. Extensive experiments on the demonstrated UCI data sets show the performance of the proposed algorithms.
Keywords :
"Training","Heuristic algorithms","Algorithm design and analysis","Data mining","Prediction algorithms","Machine learning algorithms","Electronic mail"
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2015 IEEE International Conference on
ISSN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2015.42
Filename :
7373444
Link To Document :
بازگشت