TY - GEN
T1 - Revealing the performance of MPI RMA implementations
AU - Gropp, William D.
AU - Thakur, Rajeev
PY - 2007
Y1 - 2007
N2 - The MPI remote-memory access (RMA) operations provide a different programming model from the regular MPI-1 point-to-point operations. This model is particularly appropriate for cases where there are multiple communication events for each synchronization and where the target memory locations are known by the source processes. In this paper, we describe a benchmark designed to illustrate the performance of RMA with multiple RMA operations for each synchronization, as compared with point-to-point communication. We measured the performance of this benchmark on several platforms (SGI Altix, Sun Fire, IBM SMP, Linux cluster) and MPI implementations (SGI, Sun, IBM, MPICH2, Open MPI). We also investigated the effectiveness of the various optimization options specified by the MPI standard. Our results show that MPI RMA can provide substantially higher performance than point-topoint communication on some platforms, such as SGI Altix and Sun Fire. The results also show that many opportunities still exist for performance improvements in the implementation of MPI RMA.
AB - The MPI remote-memory access (RMA) operations provide a different programming model from the regular MPI-1 point-to-point operations. This model is particularly appropriate for cases where there are multiple communication events for each synchronization and where the target memory locations are known by the source processes. In this paper, we describe a benchmark designed to illustrate the performance of RMA with multiple RMA operations for each synchronization, as compared with point-to-point communication. We measured the performance of this benchmark on several platforms (SGI Altix, Sun Fire, IBM SMP, Linux cluster) and MPI implementations (SGI, Sun, IBM, MPICH2, Open MPI). We also investigated the effectiveness of the various optimization options specified by the MPI standard. Our results show that MPI RMA can provide substantially higher performance than point-topoint communication on some platforms, such as SGI Altix and Sun Fire. The results also show that many opportunities still exist for performance improvements in the implementation of MPI RMA.
UR - http://www.scopus.com/inward/record.url?scp=38449113999&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38449113999&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-75416-9_38
DO - 10.1007/978-3-540-75416-9_38
M3 - Conference contribution
AN - SCOPUS:38449113999
SN - 9783540754152
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 272
EP - 280
BT - Recent Advances in Parallel Virtual Machine and Message Passing Interface - 14th European PVM/MPI Users' Group Meeting, Proceedings
PB - Springer
T2 - 14th European PVM/MPI Users' Group Meeting on Parallel Virtual Machine and Message Passing Interface
Y2 - 30 September 2007 through 3 October 2007
ER -