TY - GEN
T1 - Collective communication on architectures that support simultaneous communication over multiple links
AU - Chan, Ernie
AU - Van De Geijn, Robert
AU - Gropp, William
AU - Thakur, Rajeev
PY - 2006
Y1 - 2006
N2 - Traditional collective communication algorithms are designed with the assumption that a node can communicate with only one other node at a time. On new parallel architectures such as the IBM Blue Gene/L, a node can communicate with multiple nodes simultaneously. We have redesigned and reimplemented many of the MPI collective communication algorithms to take advantage of this ability to send simultaneously, including broadcast, reduce(-to-one), scatter, gather, allgather, reduce-scatter, and allreduce. We show that these new algorithms have lower expected costs than the previously known lower bounds based on old models of parallel computation. Results are included comparing their performance to the default implementations in IBM's MPI.
AB - Traditional collective communication algorithms are designed with the assumption that a node can communicate with only one other node at a time. On new parallel architectures such as the IBM Blue Gene/L, a node can communicate with multiple nodes simultaneously. We have redesigned and reimplemented many of the MPI collective communication algorithms to take advantage of this ability to send simultaneously, including broadcast, reduce(-to-one), scatter, gather, allgather, reduce-scatter, and allreduce. We show that these new algorithms have lower expected costs than the previously known lower bounds based on old models of parallel computation. Results are included comparing their performance to the default implementations in IBM's MPI.
KW - Algorithms
KW - Performance
UR - http://www.scopus.com/inward/record.url?scp=33751022826&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33751022826&partnerID=8YFLogxK
U2 - 10.1145/1122971.1122975
DO - 10.1145/1122971.1122975
M3 - Conference contribution
AN - SCOPUS:33751022826
SN - 1595931899
SN - 9781595931894
T3 - Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP
SP - 2
EP - 11
BT - Proceedings of the 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP'06
PB - Association for Computing Machinery
T2 - 2006 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP'06
Y2 - 29 March 2006 through 31 March 2006
ER -