• DocumentCode
    2248128
  • Title

    Speeding up the memory hierarchy in Flat COMA multiprocessors

  • Author

    Yang, Liuxi ; Torrellas, Josep

  • Author_Institution
    Center for Supercomput. Res. & Dev., Illinois Univ., Urbana, IL, USA
  • fYear
    1997
  • fDate
    1-5 Feb 1997
  • Firstpage
    4
  • Lastpage
    13
  • Abstract
    Scalable Flat Cache Only Memory Architectures (Flat COMA) are designed for reduced memory access latencies while minimizing programmer and operating system involvement. Indeed, to keep memory access latencies low, neither the programmer needs to perform clever data placement nor the operating system needs to perform page migration. The hardware automatically replicates the data and migrates it to the attraction memories of the nodes that use it. Unfortunately, part of the latency of memory accesses is superfluous. In particular, reads often perform unnecessary attraction memory accesses, require too many network hops, or perform necessary attraction memory accesses inefficiently. In this paper, we propose relatively inexpensive schemes that address these three problems. To eliminate unnecessary attraction memory accesses, we propose a small direct-mapped cache called Invalidation Cache (IVC). To reduce the number of network hops, the IVC is augmented with hint pointers to processors. These hint pointers are faster and have more applicability than in older hint schemes. Finally, to speed up necessary accesses to set-associative attraction memories, we optimize the locality of windows in page-mode DRAMs. We evaluate these optimizations with 32-processor simulations of 8 Splash and Perfect Suite applications. We show that these optimizations speed up the applications by an average of 20% at a modest cost
  • Keywords
    cache storage; memory architecture; multiprocessing systems; optimisation; Flat COMA multiprocessors; direct-mapped cache; invalidation cache; memory access latencies; memory hierarchy; network hops; operating system; page migration; page-mode DRAMs; scalable Flat cache only memory architectures; Contracts; Cost function; Delay; Hardware; Memory architecture; Operating systems; Programming profession; Random access memory; Research and development; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High-Performance Computer Architecture, 1997., Third International Symposium on
  • Conference_Location
    San Antonio, TX
  • Print_ISBN
    0-8186-7764-3
  • Type

    conf

  • DOI
    10.1109/HPCA.1997.569581
  • Filename
    569581