Title :
Applying feature selection to payload-based Web Application Firewalls
Author :
Torrano-Gimenez, Carmen ; Hai Thanh Nguyen ; Alvarez, Gabriel ; Petrovic, Slobodan ; Franke, Katrin
Author_Institution :
Inst. de Fis. Aplic., Consejo Super. de Investig. Cientificas, Madrid, Spain
Abstract :
Web Application Firewalls (WAFs) analyze the HTTP traffic in order to protect Web applications from attacks. To be effective, WAFs need to analyze the payload of the packets. One of the techniques used for intrusion detection is to extract features from the payload by means of n-grams. An n-gram is a subsequence of n items from a given sequence. The number of n-grams is 256 to the nth power. Since it grows exponentially with n, the curse of dimensionality and computational complexity problem arise. In this paper we propose to apply feature selection in order to reduce the number of features extracted by n-grams and thus to improve the effectiveness of WAFs. We conduct experiments on our own HTTP data set. After extracting n-grams from this data set, we apply the Generic-Feature-Selection (GeFS) measure for intrusion detection [5] to select important features. We use four different classifiers to test the detection accuracy before and after feature selection. The experiments show that we can remove more than 95% of irrelevant and redundant features from the original data set (and thus improve the performance by more than 80% on average), while reducing only slightly (by less than 6%) the accuracy of WAFs.
Keywords :
Internet; computational complexity; firewalls; telecommunication traffic; GeFS; HTTP traffic; WAF; computational complexity; data set; dimensionality; generic-feature-selection measure; intrusion detection; n-grams; payload-based Web application firewalls; Accuracy; Correlation; Data mining; Feature extraction; Intrusion detection; Optimization; Payloads; Web application firewall; Web attack detection; feature selection; intrusion detection systems; machine learning algorithms; n-grams;
Conference_Titel :
Security and Communication Networks (IWSCN), 2011 Third International Workshop on
Conference_Location :
Gjovik
DOI :
10.1109/IWSCN.2011.6827720