• DocumentCode
    598587
  • Title

    Hierarchical task mapping of cell-based AMR cosmology simulations

  • Author

    Jingjin Wu ; Zhiling Lan ; Xuanxing Xiong ; Gnedin, Nickolay Y. ; Kravtsov, A.V.

  • Author_Institution
    Dept. of Comput. Sci., Illinois Inst. of Technol., Chicago, IL, USA
  • fYear
    2012
  • fDate
    10-16 Nov. 2012
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Cosmology simulations are highly communication-intensive, thus it is critical to exploit topology-aware task mapping techniques for performance optimization. To exploit the architectural properties of multiprocessor clusters (the performance gap between inter-node and intra-node communication as well as the gap between inter-socket and intra-socket communication), we design and develop a hierarchical task mapping scheme for cell-based AMR (Adaptive Mesh Refinement) cosmology simulations, in particular, the ART application. Our scheme consists of two parts: (1) an inter-node mapping to map application processes onto nodes with the objective of minimizing network traffic among nodes and (2) an intra-node mapping within each node to minimize the maximum size of messages transmitted between CPU sockets. Experiments on production supercomputers with 3D torus and fat-tree topologies show that our scheme can significantly reduce application communication cost by up to 50%. More importantly, our scheme is generic and can be extended to many other applications.
  • Keywords
    astronomy computing; cosmology; cost reduction; minimisation; multiprocessing systems; parallel machines; 3D torus; ART; CPU sockets; adaptive mesh refinement; adaptive refinement tree; application communication cost reduction; cell-based AMR cosmology simulations; fat-tree topologies; hierarchical task mapping scheme; inter-node mapping; intra-node mapping; maximum message size minimization; multiprocessor cluster architectural properties; network traffic minimization; performance optimization; supercomputers; topology-aware task mapping techniques; Adaptation models; Computational modeling; Multicore processing; Runtime; Sockets; Subspace constraints; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    2167-4329
  • Print_ISBN
    978-1-4673-0805-2
  • Type

    conf

  • DOI
    10.1109/SC.2012.63
  • Filename
    6468474