DocumentCode
623962
Title
Self-learning classifier for Internet traffic
Author
Grimaudo, Luigi ; Mellia, Marco ; Baralis, Elena ; Keralapura, Ram
Author_Institution
Politec. di Torino, Turin, Italy
fYear
2013
fDate
14-19 April 2013
Firstpage
3381
Lastpage
3386
Abstract
Network visibility is a critical part of traffic engineering, network management, and security. Recently, unsupervised algorithms have been envisioned as a viable alternative to automatically identify classes of traffic. However, the accuracy achieved so far does not allow to use them for traffic classification in practical scenario. In this paper, we propose SeLeCT, a Self-Learning Classifier for Internet traffic. It uses unsupervised algorithms along with an adaptive learning approach to automatically let classes of traffic emerge, being identified and (easily) labeled. SeLeCT automatically groups flows into pure (or homogeneous) clusters using alternating simple clustering and filtering phases to remove outliers. SeLeCT uses an adaptive learning approach to boost its ability to spot new protocols and applications. Finally, SeLeCT also simplifies label assignment (which is still based on some manual intervention) so that proper class labels can be easily discovered. We evaluate the performance of SeLeCT using traffic traces collected in different years from various ISPs located in 3 different continents. Our experiments show that SeLeCT achieves overall accuracy close to 98%. Unlike state-of-art classifiers, the biggest advantage of SeLeCT is its ability to help discovering new protocols and applications in an almost automated fashion.
Keywords
Internet; information filtering; pattern classification; pattern clustering; protocols; telecommunication traffic; unsupervised learning; ISP; Internet service provider; Internet traffic; SeLeCT; adaptive learning approach; label assignment; network management; network security; network visibility; outlier removal; protocols; self-learning classifier; traffic classification; traffic engineering; traffic traces; unsupervised learning algorithms; Accuracy; Algorithm design and analysis; Clustering algorithms; Labeling; Ports (Computers); Protocols; Servers;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2013 Proceedings IEEE
Conference_Location
Turin
ISSN
0743-166X
Print_ISBN
978-1-4673-5944-3
Type
conf
DOI
10.1109/INFCOM.2013.6567168
Filename
6567168
Link To Document