• DocumentCode
    707410
  • Title

    Opcode position aware metamorphic malware detection: Signature vs histogram approach

  • Author

    George, Nithil ; Vinod, P.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., SCMS Sch. of Eng. & Technol., Ernakulam, India
  • fYear
    2015
  • fDate
    11-13 March 2015
  • Firstpage
    1011
  • Lastpage
    1017
  • Abstract
    Unlike the conventional approaches in the detection of metamorphic malware, a novel statistical non signature based detection technique is proposed. The proposed methodology aims to determine if alignment of locations or histogram of a specific opcode bigram is superior in the classification of metamorphic malware samples. In this work, we used Term Frequency-Inverse Document Frequency-Class Frequency (TF-IDF-CF) as feature selection method for synthesizing prominent features. Vector space models has been constructed by preserving hamming distance and Smith Waterman local sequence alignment score. Experiment results depicted that with Smith Waterman sequence alignment, best results were obtained with 300 significant malware features (94.01% accuracy, 92.24% F-measure, 100% precision and 49.89% recall). However, hamming distance based reference model, with 7 bigrams resulted in 100% precision, 99.76% accuracy, 99.71% F-measure and 99.42% recall.
  • Keywords
    feature selection; invasive software; pattern classification; sequences; F-measure value; Smith Waterman local sequence alignment score; TF-IDF-CF; accuracy value; feature selection method; feature synthesis; hamming distance-based reference model; histogram alignment; histogram approach; location alignment; malware features; metamorphic malware classification; opcode bigram; opcode position aware metamorphic malware detection; precision value; recall value; signature approach; statistical nonsignature-based detection technique; term frequency-inverse document frequency-class frequency; vector space models; Analytical models; Detectors; Feature extraction; Malware; Predictive models; Random access memory; Semantics; bigrams; hamming distance; malware; metamorphism; sequence alignment; smith waterman;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing for Sustainable Global Development (INDIACom), 2015 2nd International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-9-3805-4415-1
  • Type

    conf

  • Filename
    7100400