• DocumentCode
    3189652
  • Title

    Polynomial-time nested loop fusion with full parallelism

  • Author

    Sha, Edwin H M ; Lang, Chenhua ; Passos, Nelson L.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Notre Dame Univ., IN, USA
  • Volume
    3
  • fYear
    1996
  • fDate
    12-16 Aug 1996
  • Firstpage
    9
  • Abstract
    Data locality and synchronization overhead are two important factors that affect the performance of applications on multiprocessors. Loop fusion is an effective way of reducing synchronization and improving data locality. Traditional fusion techniques, however either cannot address the case when fusion-preventing dependence exists in nested loops, or cannot achieve good parallelism after fusion. This paper gives a significant improvement by presenting several efficient polynomial-time algorithms to solve these problems. These algorithms combined with the retiming technique allow nested loop fusion in the existence of outmost loop-carried dependence as in the presence of fusion-preventing dependence. Furthermore, the technique is proved to achieve fully parallel execution of the fused loops
  • Keywords
    computational complexity; multiprocessing systems; parallel algorithms; parallel programming; program control structures; programming theory; software performance evaluation; synchronisation; application performance; data locality; fusion-preventing dependence; multiprocessors; outmost loop-carried dependence; parallel program; polynomial-time algorithms; polynomial-time nested loop fusion; retiming technique; synchronization overhead; Application software; Computer science; Concurrent computing; Fuses; High performance computing; Law; Legal factors; Parallel processing; Polynomials; Weather forecasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Processing, 1996. Vol.3. Software., Proceedings of the 1996 International Conference on
  • Conference_Location
    Ithaca, NY
  • ISSN
    0190-3918
  • Print_ISBN
    0-8186-7623-X
  • Type

    conf

  • DOI
    10.1109/ICPP.1996.538554
  • Filename
    538554