DocumentCode
2059944
Title
Improving signature detection classification model using features selection based on customized features
Author
Othman, Zulaiha Ali ; Bakar, Azuraliza Abu ; Etubal, Intesar
Author_Institution
Fac. of Inf. Sci. & Technol., Univ. Kebangsaan Malaysia (UKM), Bangi, Malaysia
fYear
2010
fDate
Nov. 29 2010-Dec. 1 2010
Firstpage
1026
Lastpage
1031
Abstract
Having an accurate Signature Detection Classification (SDC) Model has become highly demanding for Intrusion Detection Systems (IDS) to secure networks, especially when dealing with large and complex security audit data set. Selecting appropriate network features is one of the factors that influence the accuracy of SDC model. Past research has shown that the Hidden Marcov Chain, Genetic Algorithm, and the two-second time windows are among the best features selection methods for SDC Model. However this paper aims to improve the accuracy model by applying the features extraction based customized features. The customized features are the network data set which has been preprocessed through the following steps: removing biased attributes, discretized using chi-merge and remove the attributes with string value. The previous research applies the feature extraction based on all features. The best model is measured based on the detection rate, false alarm rate and number of rules using four data mining techniques such as Ripper(Jrip), Ridor, PART and decision three. The experiment is conducted using three random KDD-cup99 data sets. The result shows that the features extraction based on customized features has increased the accuracy model between 0.4% to 9% detection rates and reduced between 0.17% to 0.5% false alarm rates. The result shows the importance of data preprocessing in producing a high quality SDC Model.
Keywords
data mining; decision trees; digital signatures; feature extraction; pattern classification; security of data; KDD-cup99 data sets; PART; Ridor; Ripper(Jrip); biased attribute removal; customized features; data mining; data preprocessing; decision three; detection rate; false alarm rate; features extraction; features selection; intrusion detection system; network security; security audit data set; signature detection classification model; Data Mining; Features Selection; Genetic algorithm feature selection; JRip algorithm; Signature Detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
Conference_Location
Cairo
Print_ISBN
978-1-4244-8134-7
Type
conf
DOI
10.1109/ISDA.2010.5687051
Filename
5687051
Link To Document