• DocumentCode
    3691698
  • Title

    Matching machine-code functions in executables within one product line via bioinformatic sequence alignment

  • Author

    Arne Wichmann;Sibylle Schupp

  • Author_Institution
    Institute for Software Systems Hamburg University of Technology, Germany
  • fYear
    2015
  • fDate
    9/1/2015 12:00:00 AM
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper we evaluate whether different executables from the same software product line have similar sequences of machine-code functions. We provide a method of creating matchings of machine-code functions using alignment techniques known from bioinformatics. We map, per function, vectors of code metrics to symbols from an alphabet using machine learning techniques, and construct sequence alignments using off-the-shelf alignment tools. Our evaluation of alignments of glibc versions, musl optimizations, different RedBoot platforms and architectures, and the Linux kernel shows that the above statement holds in all cases except for differing architectures. Our method can therefore be used to match functions in executables for most variations within one product line.
  • Keywords
    "Principal component analysis","Computer architecture","Optimization","Linux","Libraries","Proteins","Bioinformatics"
  • Publisher
    ieee
  • Conference_Titel
    Mining Unstructured Data (MUD), 2015 IEEE 5th Workshop on
  • Type

    conf

  • DOI
    10.1109/MUD.2015.7327962
  • Filename
    7327962