TY - GEN
T1 - Open issues in MPI implementation
AU - Thakur, Rajeev
AU - Gropp, William
PY - 2007
Y1 - 2007
N2 - MPI (the Message Passing Interface) continues to be the dominant programming model for parallel machines of all sizes, from small Linux clusters to the largest parallel supercomputers such as IBM Blue Gene/L and Cray XT3. Although the MPI standard was released more than 10 years ago and a number of implementations of MPI are available from both vendors and research groups, MPI implementations still need improvement in many areas. In this paper, we discuss several such areas, including performance, scalability, fault tolerance, support for debugging and verification, topology awareness, collective communication, derived datatypes, and parallel I/O. We also present results from experiments with several MPI implementations (MPICH2, Open MPI, Sun, IBM) on a number of platforms (Linux clusters, Sun and IBM SMPs) that demonstrate the need for performance improvement in one-sided communication and support for multithreaded programs.
AB - MPI (the Message Passing Interface) continues to be the dominant programming model for parallel machines of all sizes, from small Linux clusters to the largest parallel supercomputers such as IBM Blue Gene/L and Cray XT3. Although the MPI standard was released more than 10 years ago and a number of implementations of MPI are available from both vendors and research groups, MPI implementations still need improvement in many areas. In this paper, we discuss several such areas, including performance, scalability, fault tolerance, support for debugging and verification, topology awareness, collective communication, derived datatypes, and parallel I/O. We also present results from experiments with several MPI implementations (MPICH2, Open MPI, Sun, IBM) on a number of platforms (Linux clusters, Sun and IBM SMPs) that demonstrate the need for performance improvement in one-sided communication and support for multithreaded programs.
UR - http://www.scopus.com/inward/record.url?scp=38049044081&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38049044081&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-74309-5_31
DO - 10.1007/978-3-540-74309-5_31
M3 - Conference contribution
AN - SCOPUS:38049044081
SN - 9783540743088
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 327
EP - 338
BT - Advances in Computer Systems Architecture - 12th Asia-Pacific Conference, ACSAC 2007, Proceedings
PB - Springer
T2 - 12th Asia-Pacific Computer Systems Architecture Conference, ACSAC 2007
Y2 - 23 August 2007 through 25 August 2007
ER -