TY - GEN
T1 - Generic topology mapping strategies for large-scale parallel architectures
AU - Hoefler, Torsten
AU - Snir, Marc
PY - 2011
Y1 - 2011
N2 - The steadily increasing number of nodes in high-performance computing systems and the technology and power constraints lead to sparse network topologies. Efficient mapping of application communication patterns to the network topology gains importance as systems grow to petascale and beyond. Such mapping is supported in parallel programming frameworks such as MPI, but is often not well implemented. We show that the topology mapping problem is NP-complete and analyze and compare different practical topology mapping heuristics. We demonstrate an efficient and fast new heuristic which is based on graph similarity and show its utility with application communication patterns on real topologies. Our mapping strategies support heterogeneous networks and show significant reduction of congestion on torus, fat-tree, and the PERCS network topologies, for irregular communication patterns. We also demonstrate that the benefit of topology mapping grows with the network size and show how our algorithms can be used in a practical setting to optimize communication performance. Our efficient topology mapping strategies are shown to reduce network congestion by up to 80%, reduce average dilation by up to 50%, and improve benchmarked communication performance by 18%.
AB - The steadily increasing number of nodes in high-performance computing systems and the technology and power constraints lead to sparse network topologies. Efficient mapping of application communication patterns to the network topology gains importance as systems grow to petascale and beyond. Such mapping is supported in parallel programming frameworks such as MPI, but is often not well implemented. We show that the topology mapping problem is NP-complete and analyze and compare different practical topology mapping heuristics. We demonstrate an efficient and fast new heuristic which is based on graph similarity and show its utility with application communication patterns on real topologies. Our mapping strategies support heterogeneous networks and show significant reduction of congestion on torus, fat-tree, and the PERCS network topologies, for irregular communication patterns. We also demonstrate that the benefit of topology mapping grows with the network size and show how our algorithms can be used in a practical setting to optimize communication performance. Our efficient topology mapping strategies are shown to reduce network congestion by up to 80%, reduce average dilation by up to 50%, and improve benchmarked communication performance by 18%.
KW - mpi graph topologies
KW - topology mapping
UR - http://www.scopus.com/inward/record.url?scp=79959609021&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79959609021&partnerID=8YFLogxK
U2 - 10.1145/1995896.1995909
DO - 10.1145/1995896.1995909
M3 - Conference contribution
AN - SCOPUS:79959609021
SN - 9781450301022
T3 - Proceedings of the International Conference on Supercomputing
SP - 75
EP - 84
BT - ICS'11 - Proceedings of the 2011 ACM International Conference on Supercomputing
T2 - 25th ACM International Conference on Supercomputing, ICS 2011
Y2 - 31 May 2011 through 4 June 2011
ER -