Title :
Exploring the stability of feature selection for imbalanced intrusion detection data
Author :
Fang Li ; Hong Mi ; Fan Yang
Author_Institution :
Dept. of Autom., Xiamen Univ., Xiamen, China
Abstract :
The class imbalance problem is of great importance to network intrusion detection data. Previous studies on feature selection always evaluate the performance of feature selection process according to the model performance and the size of selected feature subset, which neglect the stability of feature selection. We investigate the problem of the stability of feature selection and study in detail the properties of two state-of-the-art feature selection method, i.e. support vector machine recursive feature elimination (SVM-RFE) and random forest variable importance measures (RF-VIM) on the imbalanced intrusion detection data. Experimental results on KDD Cup 99 network intrusion data show the influence of imbalance rate on the stability of the algorithms, and demonstrate that stability is an important evaluation indicator of algorithm in practical applications of intrusion detection.
Keywords :
security of data; support vector machines; KDD Cup 99 network intrusion data; class imbalance problem; feature selection; imbalanced intrusion detection data; network intrusion detection data; random forest variable importance measures; support vector machine recursive feature elimination; Accuracy; Educational institutions; Feature extraction; Intrusion detection; Stability criteria; Support vector machines; feature selection; imbalanced data; network intrusion detection; stability;
Conference_Titel :
Control and Automation (ICCA), 2011 9th IEEE International Conference on
Conference_Location :
Santiago
Print_ISBN :
978-1-4577-1475-7
DOI :
10.1109/ICCA.2011.6138076