Title :
Exploiting 162-Nanosecond End-to-End Communication Latency on Anton
Author :
Dror, Ron O. ; Grossman, J.P. ; Mackenzie, Kenneth M. ; Towles, Brian ; Chow, Edmond ; Salmon, John K. ; Young, Cliff ; Bank, Joseph A. ; Batson, Brannon ; Deneroff, Martin M. ; Kuskin, Jeffrey S. ; Larson, Richard H. ; Moraes, Mark A. ; Shaw, David E.
Author_Institution :
Center for Comput. Biol. & Bioinf., Columbia Univ., New York, NY, USA
Abstract :
Strong scaling of scientific applications on parallel architectures is increasingly limited by communication latency. This paper describes the techniques used to mitigate latency in Anton, a massively parallel special-purpose machine that accelerates molecular dynamics (MD) simulations by orders of magnitude compared with the previous state of the art. Achieving this speedup required a combination of hardware mechanisms and software constructs to reduce network latency, sender and receiver overhead, and synchronization costs. Key elements of Anton´s approach, in addition to tightly integrated communication hardware, include formulating data transfer in terms of counted remote writes, leveraging fine-grained communication, and establishing fixed, optimized communication patterns. Anton delivers software-to-software inter-node latency significantly lower than any other large-scale parallel machine, and the total critical-path communication time for an Anton MD simulation is less than 4% that of the next fastest MD platform.
Keywords :
biology computing; electronic data interchange; molecular biophysics; molecular dynamics method; parallel architectures; parallel machines; scientific information systems; Anton MD simulation; Anton machine; communication hardware; counted remote write; critical-path communication time; data transfer; end-to-end communication latency; fine-grained communication; hardware mechanism; large-scale parallel machine; molecular dynamics simulation; network latency; optimized communication pattern; parallel architecture; parallel special-purpose machine; receiver overhead; scientific application; sender overhead; software-to-software internode latency; synchronization cost; Bandwidth; Computational modeling; Force; Hardware; Radiation detectors; Software; Synchronization;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7557-5
Electronic_ISBN :
978-1-4244-7558-2