• DocumentCode
    688148
  • Title

    Performance Model for Master/Worker Hybrid Applications on Multicore Clusters

  • Author

    Castellanos, Abel ; Moreno, Alexander ; Sorribes, Joan ; Margalef, Tomas

  • Author_Institution
    Dept. Arquitectura de Computadors i Sist. Operatius, Univ. Autonoma de Barcelona, Barcelona, Spain
  • fYear
    2013
  • fDate
    13-15 Nov. 2013
  • Firstpage
    210
  • Lastpage
    217
  • Abstract
    There are several parallel applications that are implemented using a Master/Worker parallel/distributed programming paradigm. Applications using this predefined programming structure can be easily implemented using message passing programming libraries (MPI). Moreover, the multicore features present nowadays on CPU architecture can be exploited at the node level by applying thread parallelism (OpenMP). In order to exploit the benefits of both two levels of parallelism, the Master/Worker applications can be implemented as hybrid applications. However, reaching the expected performance indexes is not trivial because there are several parameters (number of nodes, number of threads per node, thread affinity and data distribution among all nodes) that must be tuned for each particular application or even during its execution to reach a successful performance. On the other hand, the application workload may change drastically during successive executions, so those parameters need to be modified according to this behavior. Additionally, cache memory architecture in multicore systems directly influences the performance and this behaviour must be deeply analysed. In this paper we present a proposal to model the performance of hybrid Master/Worker applications on multicore systems considering the issues outlined above. In particular, this model determines dynamically at runtime the configuration of the appropriate number of workers and threads of the hybrid application to achieve the best possible performance.
  • Keywords
    application program interfaces; multiprocessing systems; parallel processing; performance evaluation; CPU architecture; MPI; OpenMP; cache memory architecture; data distribution; master-worker hybrid applications; master-worker parallel-distributed programming paradigm; message passing programming libraries; multicore clusters; multicore systems; parallel applications; performance indexes; performance model; thread parallelism; Computational modeling; Data models; Mathematical model; Message systems; Multicore processing; Predictive models; Hybrid applications; MPI; Master/Worker; OpenMP; Performance model; multicore;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing (HPCC_EUC), 2013 IEEE 10th International Conference on
  • Conference_Location
    Zhangjiajie
  • Type

    conf

  • DOI
    10.1109/HPCC.and.EUC.2013.39
  • Filename
    6831921