Title :
Accurate Adware Detection Using Opcode Sequence Extraction
Author :
Shahzad, Raja Khurram ; Lavesson, Niklas ; Johnson, Henric
Author_Institution :
Sch. of Comput., Blekinge Inst. of Technol., Karlskrona, Sweden
Abstract :
Adware represents a possible threat to the security and privacy of computer users. Traditional signature-based and heuristic-based methods have not been proven to be successful at detecting this type of software. This paper presents an adware detection approach based on the application of data mining on disassembled code. The main contributions of the paper is a large publicly available adware data set, an accurate adware detection algorithm, and an extensive empirical evaluation of several candidate machine learning techniques that can be used in conjunction with the algorithm. We have extracted sequences of opcodes from adware and benign software and we have then applied feature selection, using different configurations, to obtain 63 data sets. Six data mining algorithms have been evaluated on these data sets in order to find an efficient and accurate detector. Our experimental results show that the proposed approach can be used to accurately detect both novel and known adware instances even though the binary difference between adware and legitimate software is usually small.
Keywords :
data mining; data privacy; invasive software; learning (artificial intelligence); adware detection; benign software; binary difference; data mining; disassembled code; feature selection; legitimate software; machine learning technique; opcode sequence extraction; privacy; security; Classification algorithms; Data mining; Feature extraction; Frequency measurement; Malware; Software; Vocabulary; Adware Detection; Binary Classification; Data Mining; Disassembly; Instruction Sequences; Static Analysis;
Conference_Titel :
Availability, Reliability and Security (ARES), 2011 Sixth International Conference on
Conference_Location :
Vienna
Print_ISBN :
978-1-4577-0979-1
Electronic_ISBN :
978-0-7695-4485-4
DOI :
10.1109/ARES.2011.35