• DocumentCode
    2500125
  • Title

    Improving Performance of Network Traffic Classification Systems by Cleaning Training Data

  • Author

    Gargiulo, Francesco ; Sansone, Carlo

  • Author_Institution
    Dipt. di Inf. e Sist., Univ. degli Studi di Napoli Federico II, Naples, Italy
  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    2768
  • Lastpage
    2771
  • Abstract
    In this paper we propose to apply an algorithm for finding out and cleaning mislabeled training sample in an adversarial learning context, in which a malicious user tries to camouflage training patterns in order to limit the classification system performance. In particular, we describe how this algorithm can be effectively applied to the problem of identifying HTTP traffic flowing through port TCP 80, where mislabeled samples can be forced by using port-spoofing attacks.
  • Keywords
    Internet; learning (artificial intelligence); pattern classification; security of data; HTTP traffic identification; TCP 80 port; adversarial learning context; mislabeled training sample cleaning; network traffic classification systems; port-spoofing attacks; training data cleaning; Accuracy; Cleaning; Context; Decision trees; Protocols; Training; Training data; Adversarial learning; Data Cleaning; Network Traffic Classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.678
  • Filename
    5597036