• DocumentCode
    3351084
  • Title

    Global partial replicate computation partitioning

  • Author

    Wang, Yiran ; Chen, Li ; Zhang, Zhao-Qing

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
  • fYear
    2004
  • fDate
    15-18 Aug. 2004
  • Firstpage
    108
  • Abstract
    Early parallelizing compilers use the owner-computes rule to partition computation. Partial replication is then introduced to eliminate near-neighbor communication at the cost of some replicated computation, hence improves the performance and scalability. Current exploration of partial replicate computation partitioning is limited within a single loop nest. We present a formal description of the global partial replicate computation partitioning problem, a simplified cost model and a heuristic solution. Experimental results show that the solution is superior to local approaches.
  • Keywords
    distributed memory systems; formal specification; parallel programming; parallelising compilers; program control structures; heuristic solution; loop nest; parallelizing compiler; partial replication; partition computation; replicate computation partitioning; Computers; Concurrent computing; Content addressable storage; Cost function; Degradation; Distributed computing; Frequency; Home computing; Prototypes; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 2004. ICPP 2004. International Conference on
  • ISSN
    0190-3918
  • Print_ISBN
    0-7695-2197-5
  • Type

    conf

  • DOI
    10.1109/ICPP.2004.1327910
  • Filename
    1327910