• DocumentCode
    865900
  • Title

    Architectural support for uniprocessor and multiprocessor active memory systems

  • Author

    Kim, Daehyun ; Chaudhuri, Mainak ; Heinrich, Mark ; Speight, Evan

  • Author_Institution
    Comput. Syst. Lab., Cornell Univ., Ithaca, NY, USA
  • Volume
    53
  • Issue
    3
  • fYear
    2004
  • fDate
    3/1/2004 12:00:00 AM
  • Firstpage
    288
  • Lastpage
    307
  • Abstract
    We introduce an architectural approach to improve memory system performance in both uniprocessor and multiprocessor systems. The architectural innovation is a flexible active memory controller backed by specialized cache coherence protocols that permit the transparent use of address remapping techniques. The resulting system shows significant performance improvement across a spectrum of machine configurations, from uniprocessors through single-node multiprocessors (SMPs) to distributed shared memory clusters (DSMs). Address remapping techniques exploit the data access patterns of applications to enhance their cache performance. However, they create coherence problems since the processor is allowed to refer to the same data via more than one address. While most active memory implementations require cache flushes, we present a new approach to solve the coherence problem. We leverage and extend the cache coherence protocol so that our techniques work transparently to efficiently support uniprocessor, SMP and DSM active memory systems. We detail the coherence protocol extensions to support our active memory techniques and present simulation results that show uniprocessor speedup from 1.3 to 7.6 on a range of applications and microbenchmarks. We also show remarkable performance improvement on small to medium-scale SMP and DSM multiprocessors, allowing some parallel applications to continue to scale long after their performance levels off on normal systems.
  • Keywords
    cache storage; distributed shared memory systems; memory architecture; protocols; DSM; SMP; active memory controller; address remapping technique; architectural support; cache coherence protocol; data access pattern; distributed shared memory cluster; machine configuration spectrum; memory system performance; multiprocessor system; single-node multiprocessor; uniprocessor system; Access protocols; Coherence; Computer architecture; Control systems; Hardware; Memory architecture; Multiprocessing systems; Prefetching; System performance; Technological innovation;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2004.1261836
  • Filename
    1261836