DocumentCode
3527097
Title
A method for classification of network traffic based on C5.0 Machine Learning Algorithm
Author
Bujlow, Tomasz ; Riaz, Tahir ; Pedersen, Jens Myrup
Author_Institution
Dept. of Electron. Syst., Aalborg Univ., Aalborg East, Denmark
fYear
2012
fDate
Jan. 30 2012-Feb. 2 2012
Firstpage
237
Lastpage
241
Abstract
Monitoring of the network performance in highspeed Internet infrastructure is a challenging task, as the requirements for the given quality level are service-dependent. Backbone QoS monitoring and analysis in Multi-hop Networks requires therefore knowledge about types of applications forming current network traffic. To overcome the drawbacks of existing methods for traffic classification, usage of C5.0 Machine Learning Algorithm (MLA) was proposed. On the basis of statistical traffic information received from volunteers and C5.0 algorithm we constructed a boosted classifier, which was shown to have ability to distinguish between 7 different applications in test set of 76,632-1,622,710 unknown cases with average accuracy of 99.3-99.9%. This high accuracy was achieved by using high quality training data collected by our system, a unique set of parameters used for both training and classification, an algorithm for recognizing flow direction and the C5.0 itself. Classified applications include Skype, FTP, torrent, web browser traffic, web radio, interactive gaming and SSH. We performed subsequent tries using different sets of parameters and both training and classification options. This paper shows how we collected accurate traffic data, presents arguments used in classification process, introduces the C5.0 classifier and its options, and finally evaluates and compares the obtained results.
Keywords
Internet; learning (artificial intelligence); pattern classification; quality of service; telecommunication traffic; C5.0 machine learning algorithm; FTP; Internet infrastructure; SSH; Skype; Web browser traffic; Web radio; backbone QoS analysis; backbone QoS monitoring; boosted classifier; interactive gaming; multihop networks; network performance monitoring; network traffic classification method; torrent; Accuracy; Decision trees; Error analysis; Machine learning algorithms; Payloads; Quality of service; Training; C5.0; Machine Learning Algorithms (MLAs); computer networks; performance monitoring; traffic classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing, Networking and Communications (ICNC), 2012 International Conference on
Conference_Location
Maui, HI
Print_ISBN
978-1-4673-0008-7
Electronic_ISBN
978-1-4673-0723-9
Type
conf
DOI
10.1109/ICCNC.2012.6167418
Filename
6167418
Link To Document