• DocumentCode
    2220007
  • Title

    Is machine learning losing the battle to produce transportable signatures against VoIP traffic?

  • Author

    Alshammari, Riyad ; Zincir-Heywood, A. Nur

  • Author_Institution
    Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
  • fYear
    2011
  • fDate
    5-8 June 2011
  • Firstpage
    1543
  • Lastpage
    1550
  • Abstract
    Traffic classification becomes more challenging since the traditional techniques such as port numbers or deep packet inspection are ineffective against voice over IP (VoIP) applications, which uses non-standard ports and encryption. Statistical information based on network layer with the use of machine learning (ML) can achieve high classification accuracy and produce transportable signatures. However, the ability of ML to find transportable signatures depends mainly on the training data sets. In this paper, we explore the importance of sampling training data sets for the ML algorithms, specifically Genetic Programming, C5.0, Naive Bayesian and AdaBoost, to find transportable signatures. To this end, we employed two techniques for sampling network training data sets, namely random sampling and consecutive sampling. Results show that random sampling and 90-minute consecutive sampling have the best performance in terms of accuracy using C5.0 and SBB, respectively. In terms of complexity, the size of C5.0 solutions increases as the training size increases, whereas SBB finds simpler solutions.
  • Keywords
    Bayes methods; Internet telephony; genetic algorithms; learning (artificial intelligence); telecommunication security; telecommunication traffic; AdaBoost; C5.0; VoIP traffic classification; consecutive sampling; genetic programming; machine learning; naive Bayesian; random sampling; transportable signatures; voice over IP; Bayesian methods; Cryptography; Decision trees; Payloads; Protocols; Training; Training data; Classification; Encryption; Machine Learning; Sampling training data sets; Security; VoIP; transportability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2011 IEEE Congress on
  • Conference_Location
    New Orleans, LA
  • ISSN
    Pending
  • Print_ISBN
    978-1-4244-7834-7
  • Type

    conf

  • DOI
    10.1109/CEC.2011.5949799
  • Filename
    5949799