• DocumentCode
    922323
  • Title

    An iteration partition approach for cache or local memory thrashing on parallel processing

  • Author

    Fang, Jesse Zhixi ; Lu, Mi

  • Author_Institution
    Hewlett-Packard Lab., Palo Alto, CA, USA
  • Volume
    42
  • Issue
    5
  • fYear
    1993
  • fDate
    5/1/1993 12:00:00 AM
  • Firstpage
    529
  • Lastpage
    546
  • Abstract
    Parallel processing systems with cache or local memory in the memory hierarchies are considered. These systems have a local cache memory in each processor and usually employ a write-invalidate protocol for the cache coherence. In such systems, a problem called `cache or local memory thrashing´ can arise in executions of parallel programs, when the data unnecessarily moves back and forth between the caches or local memories in different processors. An approach to eliminate, or at least to reduce, such movement for nested parallel loops is presented. It is based on relations between array element accesses and enclosed loop indexes in the loops. The relations can be used to assign processors to execute the appropriate iterations for parallel loops in the loop nests with respect to the data in their caches or local memories. An algorithm for calculating the correct iteration of the parallel loop in terms of loop indexes of the previous iterations executed in the processor is presented. This method benefits parallel code with nested loop structures in a wide range of applications. The experimental results show that the technique can achieve speedups up to 2
  • Keywords
    iterative methods; memory architecture; parallel programming; storage management; array element accesses; cache coherence; correct iteration; enclosed loop indexes; iteration partition approach; local cache memory; local memories; local memory thrashing; loop nests; memory hierarchies; nested loop structures; nested parallel loops; parallel code; parallel loops; parallel programs; write-invalidate protocol; Access protocols; Bridges; Cache memory; Degradation; Dynamic scheduling; Hardware; Laboratories; Parallel processing; System performance; Yarn;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/12.223672
  • Filename
    223672