• DocumentCode
    2059944
  • Title

    Improving signature detection classification model using features selection based on customized features

  • Author

    Othman, Zulaiha Ali ; Bakar, Azuraliza Abu ; Etubal, Intesar

  • Author_Institution
    Fac. of Inf. Sci. & Technol., Univ. Kebangsaan Malaysia (UKM), Bangi, Malaysia
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 1 2010
  • Firstpage
    1026
  • Lastpage
    1031
  • Abstract
    Having an accurate Signature Detection Classification (SDC) Model has become highly demanding for Intrusion Detection Systems (IDS) to secure networks, especially when dealing with large and complex security audit data set. Selecting appropriate network features is one of the factors that influence the accuracy of SDC model. Past research has shown that the Hidden Marcov Chain, Genetic Algorithm, and the two-second time windows are among the best features selection methods for SDC Model. However this paper aims to improve the accuracy model by applying the features extraction based customized features. The customized features are the network data set which has been preprocessed through the following steps: removing biased attributes, discretized using chi-merge and remove the attributes with string value. The previous research applies the feature extraction based on all features. The best model is measured based on the detection rate, false alarm rate and number of rules using four data mining techniques such as Ripper(Jrip), Ridor, PART and decision three. The experiment is conducted using three random KDD-cup99 data sets. The result shows that the features extraction based on customized features has increased the accuracy model between 0.4% to 9% detection rates and reduced between 0.17% to 0.5% false alarm rates. The result shows the importance of data preprocessing in producing a high quality SDC Model.
  • Keywords
    data mining; decision trees; digital signatures; feature extraction; pattern classification; security of data; KDD-cup99 data sets; PART; Ridor; Ripper(Jrip); biased attribute removal; customized features; data mining; data preprocessing; decision three; detection rate; false alarm rate; features extraction; features selection; intrusion detection system; network security; security audit data set; signature detection classification model; Data Mining; Features Selection; Genetic algorithm feature selection; JRip algorithm; Signature Detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications (ISDA), 2010 10th International Conference on
  • Conference_Location
    Cairo
  • Print_ISBN
    978-1-4244-8134-7
  • Type

    conf

  • DOI
    10.1109/ISDA.2010.5687051
  • Filename
    5687051