TY - JOUR
T1 - Toward message passing for a million processes
T2 - Characterizing MPI on a massive scale blue gene/P
AU - Balaji, Pavan
AU - Chan, Anthony
AU - Thakur, Rajeev
AU - Gropp, William
AU - Lusk, Ewing
PY - 2009/9
Y1 - 2009/9
N2 - Upcoming exascale capable systems are expected to comprise more than a million processing elements. As researchers continue to work toward architecting these systems, it is becoming increasingly clear that these systems will utilize a significant amount of shared hardware between processing units; this includes shared caches, memory and network components. Thus, understanding how effective current message passing and communication infrastructure is in tying these processing elements together, is critical to making educated guesses on what we can expect from such future machines. Thus, in this paper, we characterize the communication performance of the message passing interface (MPI) implementation on 32 racks (131072 cores) of the largest Blue Gene/P (BG/P) system in the United States (80% of the total system size) and reveal various interesting insights into it.
AB - Upcoming exascale capable systems are expected to comprise more than a million processing elements. As researchers continue to work toward architecting these systems, it is becoming increasingly clear that these systems will utilize a significant amount of shared hardware between processing units; this includes shared caches, memory and network components. Thus, understanding how effective current message passing and communication infrastructure is in tying these processing elements together, is critical to making educated guesses on what we can expect from such future machines. Thus, in this paper, we characterize the communication performance of the message passing interface (MPI) implementation on 32 racks (131072 cores) of the largest Blue Gene/P (BG/P) system in the United States (80% of the total system size) and reveal various interesting insights into it.
UR - http://www.scopus.com/inward/record.url?scp=69549107488&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=69549107488&partnerID=8YFLogxK
U2 - 10.1007/s00450-009-0095-3
DO - 10.1007/s00450-009-0095-3
M3 - Article
AN - SCOPUS:69549107488
SN - 1865-2034
VL - 24
SP - 11
EP - 19
JO - Computer Science - Research and Development
JF - Computer Science - Research and Development
IS - 1-2
ER -