DocumentCode
2500125
Title
Improving Performance of Network Traffic Classification Systems by Cleaning Training Data
Author
Gargiulo, Francesco ; Sansone, Carlo
Author_Institution
Dipt. di Inf. e Sist., Univ. degli Studi di Napoli Federico II, Naples, Italy
fYear
2010
fDate
23-26 Aug. 2010
Firstpage
2768
Lastpage
2771
Abstract
In this paper we propose to apply an algorithm for finding out and cleaning mislabeled training sample in an adversarial learning context, in which a malicious user tries to camouflage training patterns in order to limit the classification system performance. In particular, we describe how this algorithm can be effectively applied to the problem of identifying HTTP traffic flowing through port TCP 80, where mislabeled samples can be forced by using port-spoofing attacks.
Keywords
Internet; learning (artificial intelligence); pattern classification; security of data; HTTP traffic identification; TCP 80 port; adversarial learning context; mislabeled training sample cleaning; network traffic classification systems; port-spoofing attacks; training data cleaning; Accuracy; Cleaning; Context; Decision trees; Protocols; Training; Training data; Adversarial learning; Data Cleaning; Network Traffic Classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location
Istanbul
ISSN
1051-4651
Print_ISBN
978-1-4244-7542-1
Type
conf
DOI
10.1109/ICPR.2010.678
Filename
5597036
Link To Document