Title :
Unity in Diversity: Phylogenetic-inspired Techniques for Reverse Engineering and Detection of Malware Families
Author :
Khoo, Wei Ming ; Lió, Pietro
Author_Institution :
Univ. of Cambridge, Cambridge, UK
Abstract :
We developed a framework for abstracting, aligning and analysing malware execution traces and performed a preliminary exploration of state of the art phylogenetic methods, whose strengths lie in pattern recognition and visualisation, to derive the statistical relationships within two contemporary malware families. We made use of phylogenetic trees and networks, motifs, logos, composition biases, and tree topology comparison methods with the objective of identifying common functionality and studying sources of variation in related samples. Networks were more useful for visualising short nop-equivalent code metamorphism than trees, tree topology comparison was suited for studying variations in multiple sets of homologous procedures. We found logos could be used for code normalisation, which resulted in 33% to 62% reduction in the number of instructions. A motif search showed that API sequences related to the management of memory, I/O, libraries and threading do not change significantly amongst malware variants, composition bias provided an efficient way to distinguish between families. Using context-sensitive procedure analysis, we found that 100% of a set of memory management procedures used by the FakeAV-DO and "Skyhoo" malware families were uniquely identifiable. We discuss how phylogenetic techniques can aid the reverse engineering and detection of malware families and describe some related challenges.
Keywords :
application program interfaces; invasive software; reverse engineering; trees (mathematics); API sequences; FakeAV-DO; context-sensitive procedure analysis; malware family detection; memory management; pattern recognition; phylogenetic-inspired techniques; reverse engineering; tree topology; Kernel; Malware; Matrices; Phylogeny; Proteins; Topology; Vegetation; malware analysis; phylogenetics;
Conference_Titel :
SysSec Workshop (SysSec), 2011 First
Conference_Location :
Amsterdam
Print_ISBN :
978-1-4577-1528-0
DOI :
10.1109/SysSec.2011.24