• DocumentCode
    2050490
  • Title

    Remove the memory wall: from performance modeling to architecture optimization

  • Author

    Sun, Xian-He

  • Author_Institution
    Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL
  • fYear
    2006
  • fDate
    25-29 April 2006
  • Abstract
    Summary form only given. Data access is a known bottleneck of high performance computing (HPC). The prime sources of this bottleneck are the performance gap between the processor and memory storage and the large memory requirements of ever-hungry applications. Although advanced memory hierarchies and parallel file systems have been developed in recent years, they only provide high bandwidth for contiguous, well-formed data streams, performing poorly for accessing small, noncontiguous data. Unfortunately, many HPC applications make a large number of requests for small and noncontiguous pieces of data, as do high-level I/O libraries such as HDF-5. The problematic memory wall remains after years of study and, in fact, is becoming the most important issue of HPC. We propose a new I/O architecture for HPC. Unlike traditional I/O designs where data is stored and retrieved by request, our architecture is based on a novel "server-push" model in which a data access server proactively pushes data from a file server to the compute node\´s memory or to it\´s cache directly based on the architecture design. Simulation results show that with the new approach the cache hit rates increase well above 90% for various benchmark applications that are notorious for poor cache performance. Performance evaluation is the driven force of the push-based model. Mechanisms of performance modeling, evaluation, and optimization are applied to data access pattern identification, prefetching algorithm design, data replacement strategy development, and architecture optimization to enable the "server-push" model. Our current success illustrates the power and unique role of performance evaluation in computing
  • Keywords
    file servers; input-output programs; memory architecture; parallel processing; performance evaluation; HDF-5; I/O architecture; architecture optimization; data access pattern identification; data replacement strategy development; file server; high performance computing; high-level I/O library; memory storage; memory wall; parallel file system; performance evaluation; performance modeling; prefetching algorithm design; server-push model; Bandwidth; Computational modeling; Computer architecture; Design optimization; File servers; File systems; High performance computing; Information retrieval; Libraries; Prefetching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2006. IPDPS 2006. 20th International
  • Conference_Location
    Rhodes Island
  • Print_ISBN
    1-4244-0054-6
  • Type

    conf

  • DOI
    10.1109/IPDPS.2006.1639621
  • Filename
    1639621