DocumentCode
2220007
Title
Is machine learning losing the battle to produce transportable signatures against VoIP traffic?
Author
Alshammari, Riyad ; Zincir-Heywood, A. Nur
Author_Institution
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
fYear
2011
fDate
5-8 June 2011
Firstpage
1543
Lastpage
1550
Abstract
Traffic classification becomes more challenging since the traditional techniques such as port numbers or deep packet inspection are ineffective against voice over IP (VoIP) applications, which uses non-standard ports and encryption. Statistical information based on network layer with the use of machine learning (ML) can achieve high classification accuracy and produce transportable signatures. However, the ability of ML to find transportable signatures depends mainly on the training data sets. In this paper, we explore the importance of sampling training data sets for the ML algorithms, specifically Genetic Programming, C5.0, Naive Bayesian and AdaBoost, to find transportable signatures. To this end, we employed two techniques for sampling network training data sets, namely random sampling and consecutive sampling. Results show that random sampling and 90-minute consecutive sampling have the best performance in terms of accuracy using C5.0 and SBB, respectively. In terms of complexity, the size of C5.0 solutions increases as the training size increases, whereas SBB finds simpler solutions.
Keywords
Bayes methods; Internet telephony; genetic algorithms; learning (artificial intelligence); telecommunication security; telecommunication traffic; AdaBoost; C5.0; VoIP traffic classification; consecutive sampling; genetic programming; machine learning; naive Bayesian; random sampling; transportable signatures; voice over IP; Bayesian methods; Cryptography; Decision trees; Payloads; Protocols; Training; Training data; Classification; Encryption; Machine Learning; Sampling training data sets; Security; VoIP; transportability;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation (CEC), 2011 IEEE Congress on
Conference_Location
New Orleans, LA
ISSN
Pending
Print_ISBN
978-1-4244-7834-7
Type
conf
DOI
10.1109/CEC.2011.5949799
Filename
5949799
Link To Document