• DocumentCode
    2573037
  • Title

    A hierarchical locality algorithm for NUMA compilation

  • Author

    O´Boyle, M.

  • Author_Institution
    Dept. of Comput. Sci., Manchester Univ.
  • fYear
    1995
  • fDate
    25-27 Jan 1995
  • Firstpage
    106
  • Lastpage
    113
  • Abstract
    A compiler algorithm which exploits program locality and reduces the latency overhead in parallel hierarchical memory machines is described. By applying the appropriate transformation at different levels of the hierarchy, the amount of nonlocal accesses between processors is minimised. Similarly, the memory structure within a processor is exploited so reducing the amount of communication between local main memory and private cache. This algorithm is based on a compound sequence of transformations that goes beyond unimodular transformations described in previous Work. This algorithm can exploit locality in complex array accesses and general iteration spaces. Furthermore, by use of strip mining and a novel use of data alignment, excessive storage for temporaries can be prevented
  • Keywords
    distributed memory systems; parallel machines; program compilers; storage management; NUMA compilation; compiler algorithm; complex array accesses; compound sequenc; data alignment; general iteration spaces; hierarchical locality algorithm; latency overhead; memory structure; nonlocal accesses; parallel hierarchical memory machines; program locality; strip mining; unimodular transformations; Computer science; Concurrent computing; Context; Cost function; Delay; Distributed computing; Magnetic heads; Memory architecture; Parallel processing; Program processors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing, 1995. Proceedings. Euromicro Workshop on
  • Conference_Location
    San Remo
  • Print_ISBN
    0-8186-7031-2
  • Type

    conf

  • DOI
    10.1109/EMPDP.1995.389149
  • Filename
    389149