• DocumentCode
    2547270
  • Title

    Analysis and Performance Results of a Molecular Modeling Application on Merrimac

  • Author

    Erez, Mattam ; Ahn, Jung Ho ; Garg, Ankit ; Dally, William J. ; Darve, Eric

  • Author_Institution
    Stanford University
  • fYear
    2004
  • fDate
    06-12 Nov. 2004
  • Firstpage
    42
  • Lastpage
    42
  • Abstract
    The Merrimac supercomputer uses stream processors and a high-radix network to achieve high performance at low cost and low power. The stream architecture matches the capabilities of modem semiconductor technology with compute-intensive parallel applications. We present a detailed case study of porting the GROMACS molecular-dynamics force calculation to Merrimac. The characteristics of the architecture which stress locality, parallelism, and decoupling of memory operations and computation, allow for high performance of compiler optimized code. The rich set of hardware memory operations and the ample computation bandwidth of the Merrimac processor present a wide range of algorithmic trade-offs and optimizations which may be generalized to several scientific computing domains. We use a cycle-accurate hardware simulator to analyze the performance bottlenecks of the various implementations and to measure application run-time. A comparison with the highly optimized GROMACS code, tuned for an Intel Pentium 4, confirms Merrimac’s potential to deliver high performance.
  • Keywords
    Computer applications; Computer architecture; Concurrent computing; Costs; Hardware; Modems; Parallel processing; Performance analysis; Stress; Supercomputers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, 2004. Proceedings of the ACM/IEEE SC2004 Conference
  • Print_ISBN
    0-7695-2153-3
  • Type

    conf

  • DOI
    10.1109/SC.2004.69
  • Filename
    1392972