DocumentCode :
3606897
Title :
Ensemble of distributed learners for online classification of dynamic data streams
Author :
Canzian, Luca ; Yu Zhang ; Van der Schaar, Mihaela
Author_Institution :
Qascom, Bassano del Grappa, Italy
Volume :
1
Issue :
3
fYear :
2015
Firstpage :
180
Lastpage :
194
Abstract :
We present a distributed online learning scheme to classify data captured from distributed and dynamic data sources. Our scheme consists of multiple distributed local learners, which analyze different streams of data that are correlated to a common event that needs to be classified. Each learner uses a local classifier to make a local prediction. The local predictions are then collected by each learner and combined using a weighted majority rule to output the final prediction. We propose a novel online ensemble learning algorithm to update the aggregation rule in order to adapt to the underlying data dynamics. We rigorously determine an upper bound for the worst-case mis-classification probability of our algorithm, which tends asymptotically to 0 if the misclassification probability of the best (unknown) static aggregation rule is 0. Then we extend our algorithm to address challenges specific to the distributed implementation and prove new bounds that apply to these settings. Finally, we test our scheme by performing an evaluation study on several data sets.
Keywords :
data handling; distributed processing; learning (artificial intelligence); pattern classification; data dynamics; distributed learners; distributed online learning; dynamic data sources; dynamic data streams; multiple distributed local learners; online classification; online ensemble learning algorithm; Distributed databases; Heuristic algorithms; Indexes; Information processing; Prediction algorithms; Pulse width modulation; Training; Big data; Online learning; big data; classification; concept drift; distributed learning; dynamic streams; ensemble of classifiers; machine learning; online learning;
fLanguage :
English
Journal_Title :
Signal and Information Processing over Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
2373-776X
Type :
jour
DOI :
10.1109/TSIPN.2015.2470125
Filename :
7274771
Link To Document :
بازگشت