• DocumentCode
    2397005
  • Title

    Incorporating memory layout in the modeling of message passing programs

  • Author

    Seinstra, F.J. ; Koelma, D.

  • Author_Institution
    Fac. of Sci., Amsterdam Univ., Netherlands
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    293
  • Lastpage
    300
  • Abstract
    One of the most fundamental tasks of an automatic parallelization tool is to find an optimal domain decomposition for a given application. For regular domain problems (such as simple matrix manipulations) this task may seem trivial. However, communication costs in message passing programs often significantly depend on the memory layout of data blocks to be transmitted. As a consequence, straightforward domain decompositions may be non-optimal. In this paper we introduce a new point-to-point communication model (called P-3PC) that is specifically designed to overcome this problem. In comparison with related models (e.g., LogGP) P-3PC is similar in complexity, but more accurate in many situations. Although the model is aimed at MPI´s standard point-to-point operations, it is applicable to similar message passing definitions as well. The effectiveness of the model is tested in a framework for automatic parallelization of imaging applications. Experiments are performed on two Beowulf-type systems, each having a different interconnection network, and a different MPI implementation. Results show that, where other models frequently fail, P-3PC correctly predicts the communication costs related to any type of domain decomposition
  • Keywords
    application program interfaces; message passing; parallelising compilers; Beowulf-type systems; P-3PQ; automatic parallelization tool; communication costs; data blocks; domain decompositions; matrix manipulations; memory layout; message passing programs; optimal domain decomposition; point-to-point communication model; Automatic testing; Costs; Image processing; Information systems; Intelligent sensors; Intelligent systems; Matrix decomposition; Message passing; Multiprocessor interconnection networks; Predictive models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel, Distributed and Network-based Processing, 2002. Proceedings. 10th Euromicro Workshop on
  • Conference_Location
    Canary Islands
  • Print_ISBN
    0-7695-1444-8
  • Type

    conf

  • DOI
    10.1109/EMPDP.2002.994294
  • Filename
    994294