TY - GEN
T1 - Toward efficient support for multithreaded MPI communication
AU - Balaji, Pavan
AU - Buntinas, Darius
AU - Goodell, David
AU - Gropp, William
AU - Thakur, Rajeev
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2008
Y1 - 2008
N2 - To make the most effective use of parallel machines that are being built out of increasingly large multicore chips, researchers are exploring the use of programming models comprising a mixture of MPI and threads. Such hybrid models require efficient support from an MPI implementation for MPI messages sent from multiple threads simultaneously. In this paper, we explore the issues involved in designing such an implementation. We present four approaches to building a fully thread-safe MPI implementation, with decreasing levels of critical-section granularity (from coarse-grain locks to fine-grain locks to lock-free operations) and correspondingly increasing levels of complexity. We describe how we have structured our implementation to support all four approaches and enable one to be selected at build time. We present performance results with a message-rate benchmark to demonstrate the performance implications of the different approaches.
AB - To make the most effective use of parallel machines that are being built out of increasingly large multicore chips, researchers are exploring the use of programming models comprising a mixture of MPI and threads. Such hybrid models require efficient support from an MPI implementation for MPI messages sent from multiple threads simultaneously. In this paper, we explore the issues involved in designing such an implementation. We present four approaches to building a fully thread-safe MPI implementation, with decreasing levels of critical-section granularity (from coarse-grain locks to fine-grain locks to lock-free operations) and correspondingly increasing levels of complexity. We describe how we have structured our implementation to support all four approaches and enable one to be selected at build time. We present performance results with a message-rate benchmark to demonstrate the performance implications of the different approaches.
UR - http://www.scopus.com/inward/record.url?scp=56449090423&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56449090423&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-87475-1_20
DO - 10.1007/978-3-540-87475-1_20
M3 - Conference contribution
AN - SCOPUS:56449090423
SN - 3540874747
SN - 9783540874744
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 120
EP - 129
BT - Recent Advances in Parallel Virtual Machine and Message Passing Interface - 15th European PVM/MPI Users' Group Meeting, Proceedings
T2 - 15th European PVM/MPI Users' Group Meeting, EuroPVM/MPI 2008
Y2 - 7 September 2008 through 10 September 2008
ER -