• DocumentCode
    2320279
  • Title

    Towards Trusted Services: Result Verification Schemes for MapReduce

  • Author

    Huang, Chu ; Zhu, Sencun ; Wu, Dinghao

  • Author_Institution
    Sch. of Inf. Sci. & Technol., Pennsylvania State Univ., University Park, PA, USA
  • fYear
    2012
  • fDate
    13-16 May 2012
  • Firstpage
    41
  • Lastpage
    48
  • Abstract
    Recent development in Internet-scale data applications and services, combined with the proliferation of cloud computing, has created a new computing model for data intensive computing best characterized by the MapReduce paradigm. The MapReduce computing paradigm, pioneered by Google in its Internet search application, is an architectural and programming model for efficiently processing massive amount of raw unstructured data. With the availability of the open source Hadoop tools, applications built based on the MapReduce computing model are rapidly growing. In this work, we focus on a unique security concern on the MapReduce architecture. Given the potential security risks from lazy or malicious servers involved in a MapReduce task, we design efficient and innovative mechanisms for detecting cheating services under the MapReduce environment based on watermark injection and random sampling methods. The new detection schemes are expected to significantly reduce the cost of verification overhead. Finally, extensive analytical and experimental evaluation confirms the effectiveness of our schemes in MapReduce result verification.
  • Keywords
    cloud computing; program verification; public domain software; random processes; sampling methods; search engines; software architecture; trusted computing; watermarking; Google; Internet search application; Internet-scale data applications; Internet-scale data services; MapReduce architecture; MapReduce computing paradigm; architectural model; cheating services detection; cloud computing; data intensive computing; lazy servers; malicious servers; open source Hadoop tools; programming model; random sampling methods; security risks; trusted services; verification overhead cost reduction; watermark injection; Computational modeling; Equations; Indexes; Mathematical model; Watermarking; Web pages; MapReduce; random sampling; result verification; trustworthy; watermark injection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on
  • Conference_Location
    Ottawa, ON
  • Print_ISBN
    978-1-4673-1395-7
  • Type

    conf

  • DOI
    10.1109/CCGrid.2012.77
  • Filename
    6217403