• DocumentCode
    172829
  • Title

    Taming Computation Skews of Block-Oriented Iterative Scientific Applications in MapReduce Systems

  • Author

    Xin Yang ; Min Li ; Ze Yu ; Xiaolin Li

  • Author_Institution
    Scalable Software Syst. Lab., Univ. of Florida, Gainesville, FL, USA
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    176
  • Lastpage
    183
  • Abstract
    Nowadays, scientists are embracing big data techniques for exploring significant discoveries from large volumes of scientific data quickly. Properly partitioning workloads is essential for fully exploiting the benefit of parallelism, but is difficult for applications whose computations change iteratively. Computation skews are inevitable when executing block-oriented iterative scientific applications in MapReduce systems. This paper proposes iPart, an autonomic workload partitioning system for taming computation skews of block-oriented iterative scientific applications in MapReduce systems. iPart introduces a workload control loop into the conventional execution of MapReduce jobs. Workload estimates in terms of execution time are collected in the reduce phase and fed back to the partition phase to update partitioning plans. Computation skews are detected and addressed by adapting partitioning to computation changes iteratively. Two adaptive partitioning methods based on the binary partitioning method are presented. Experimental evaluations with two simulated applications and the synthetic and real-world data prove that iPart responds to computation changes and adapts partitioning quickly and accurately.
  • Keywords
    Big Data; distributed processing; natural sciences computing; MapReduce systems; adaptive partitioning methods; autonomic workload partitioning; big data techniques; binary partitioning method; block-oriented iterative scientific applications; computation skews taming; iPart; partitioning plan updating; workload control loop; Binary trees; Computational modeling; Merging; Parallel processing; Synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cloud Computing (CLOUD), 2014 IEEE 7th International Conference on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5062-1
  • Type

    conf

  • DOI
    10.1109/CLOUD.2014.33
  • Filename
    6973739