• DocumentCode
    160603
  • Title

    A Scalable Approach to Source Camera Identification over Hadoop

  • Author

    Cattaneo, Giuseppe ; Roscigno, Gianluca ; Petrillo, Umberto Ferraro

  • Author_Institution
    Dipt. di Inf., Univ. degli Studi di Salerno, Fisciano, Italy
  • fYear
    2014
  • fDate
    13-16 May 2014
  • Firstpage
    366
  • Lastpage
    373
  • Abstract
    In this paper, we explore the possibility to solve a commonly-known digital image forensics problem, the Source Camera Identification (SCI) problem, using a distributed approach. The SCI problem requires to recognize the camera used to acquire a given digital image, distinguishing even among cameras of the same brand and model. The solution we present is based on the algorithm by Lukas Fridrich, as it is recognized by many as the reference solution for this problem, and is formulated according to the MapReduce paradigm, as implemented by the Hadoop framework. The first implementation we coded was straightforward to obtain as we leveraged the ability of the Hadoop framework to turn a stand-alone Java application into a distributed one with very few interventions on its original source code. However, our first experimental results with this code were not encouraging. Thus, we conducted a careful profiling activity that allowed us to pinpoint some serious performance issues arising with this vanilla porting of the algorithm. We then developed several optimizations to improve the performance of the Lukas algorithm by taking better advantage of the Hadoop framework. The out coming implementations have been subject to a thorough experimental analysis, conducted using a cluster of 33 commodity PCs and a data set of 5, 160 images. The experimental results show that the performance of our optimized implementations scale well with the number of computing nodes while exhibiting performance that are, at most, two times slower than the maximum speedup theoretically achievable.
  • Keywords
    Java; cameras; image forensics; Hadoop; Java application; Lukas algorithm; MapReduce paradigm; SCI problem; camera recognition; digital image forensics problem; profiling activity; scalable approach; source camera identification; Cameras; Correlation; Digital images; Image color analysis; Noise; Spatial resolution; Training; Digital Image Forensics; Distributed Computing; Hadoop; Source Camera Identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications (AINA), 2014 IEEE 28th International Conference on
  • Conference_Location
    Victoria, BC
  • ISSN
    1550-445X
  • Print_ISBN
    978-1-4799-3629-8
  • Type

    conf

  • DOI
    10.1109/AINA.2014.47
  • Filename
    6838688