• DocumentCode
    56339
  • Title

    Parallel Workload Modeling with Realistic Characteristics

  • Author

    Tran Ngoc Minh ; Thoai Nam ; Epema, Dick H. J.

  • Author_Institution
    Leiden Inst. of Adv. Comput. Sci., Leiden Univ., Leiden, Netherlands
  • Volume
    25
  • Issue
    8
  • fYear
    2014
  • fDate
    Aug. 2014
  • Firstpage
    2138
  • Lastpage
    2148
  • Abstract
    Workload modeling and performance evaluation play crucial roles in the study of scheduling algorithms on large-scale parallel and distributed systems. An effective design of a scheduling algorithm for these systems requires experiments with hundreds of simulations to evaluate its performance. Since each simulation needs one workload as input, only real workloads with usually a limited availability are not sufficient, and so representative workload models are needed. Several studies have shown that realistic workload characteristics such as burstiness, bag-of-tasks, etc., cause significant performance impacts on scheduling. Therefore, we argue that realistic workload models should contain as many characteristics of real workloads as possible. In practice, researchers use unrealistic workloads in their scheduling evaluations because they lack models that can help generate realistic workloads. In this article, we analyze real parallel workloads to show the presence of important characteristics including long range dependence, periodicity and temporal burstiness of job arrivals, bag-of-tasks behavior, and correlation of runtime and number of processors. Then, we present a systematic approach to create a complete model that contains all of these characteristics. Validation of our model with real world data shows that it does not only capture the above characteristics, but also can fit marginal distributions well.
  • Keywords
    parallel processing; scheduling; software performance evaluation; bag-of-tasks; distributed systems; large-scale parallel systems; long range dependence; marginal distributions; parallel workload modeling; performance evaluation; realistic characteristics; representative workload models; scheduling algorithms; scheduling evaluations; temporal job arrival burstiness; Correlation; Data models; Load modeling; Local area networks; Materials; Parallel processing; Runtime; Parallel workload modeling; bag-of-tasks; correlation; long range dependence; periodicity; temporal burstiness;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2013.182
  • Filename
    6567858