DocumentCode :
3288307
Title :
Evaluation of Random Projection for Malware Classification
Author :
Ponomarev, Stanislav ; Durand, Jan ; Wallace, Nathan ; Atkison, Travis
Author_Institution :
Louisiana Tech Univ., Ruston, LA, USA
fYear :
2013
fDate :
18-20 June 2013
Firstpage :
68
Lastpage :
73
Abstract :
Research efforts to develop malicious application detection algorithms have been a priority ever since the discovery of the first "viruses". Various methods are used to search and identify these malicious applications. One such method, n-gram analysis, can be implemented to extract features from binary files. These features are then be used by machine learning algorithms to classify them as malicious or benign. However, the resulting high dimensionality of the features makes accurate detection in some cases impossible. This is known as "the curse of dimensionality". To counteract this effect, a feature reduction technique known as randomized projection was implemented. Through this reduction, not only are classification times decreased but also an increase in true positive and decreases false positive rates are observed. By varying the n-gram size and target feature size it is possible to fine-tune the accuracy of machine learning algorithms to reach an average accuracy of 99%.
Keywords :
data mining; feature extraction; invasive software; learning (artificial intelligence); pattern classification; binary files; data mining; feature extraction; feature reduction technique; machine learning algorithms; malicious application detection algorithms; malware classification; n-gram analysis; random projection evaluation; the curse of dimensionality; Accuracy; Data mining; Feature extraction; Malware; Mutual information; Support vector machines; Vectors; Computer security; Data mining; Feature extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Security and Reliability-Companion (SERE-C), 2013 IEEE 7th International Conference on
Conference_Location :
Gaithersburg, MD
Print_ISBN :
978-1-4799-2924-5
Type :
conf
DOI :
10.1109/SERE-C.2013.29
Filename :
6616327
Link To Document :
بازگشت