TY - GEN
T1 - Toward runtime power management of exascale networks by on/off control of links
AU - Totoni, Ehsan
AU - Jain, Nikhil
AU - Kale, Laxmikant V.
PY - 2013
Y1 - 2013
N2 - Higher radix networks, such as high-dimensional tori and multi-level directly connected networks, are being used for supercomputers as they become larger but need lower diameter. These networks have more resources (e.g. links) in order to provide good performance for a range of applications. We observe that a sizeable fraction of the links in the interconnect are never used or underutilized during execution of common parallel applications. Thus, in order to save power, we propose addition of hardware support for on/off control of links in software and their management using adaptive runtime systems. We study the effectiveness of our approach using real applications (NAMD, MILC), and application benchmarks (NAS Parallel Benchmarks, Jacobi). They are simulated on representative topologies such as 6-D Torus and Dragonfly (e.g. IBM PERCS, Cray Aries). For common applications, our approach can save up to 16% of total machine's power and energy, without any performance penalty.
AB - Higher radix networks, such as high-dimensional tori and multi-level directly connected networks, are being used for supercomputers as they become larger but need lower diameter. These networks have more resources (e.g. links) in order to provide good performance for a range of applications. We observe that a sizeable fraction of the links in the interconnect are never used or underutilized during execution of common parallel applications. Thus, in order to save power, we propose addition of hardware support for on/off control of links in software and their management using adaptive runtime systems. We study the effectiveness of our approach using real applications (NAMD, MILC), and application benchmarks (NAS Parallel Benchmarks, Jacobi). They are simulated on representative topologies such as 6-D Torus and Dragonfly (e.g. IBM PERCS, Cray Aries). For common applications, our approach can save up to 16% of total machine's power and energy, without any performance penalty.
UR - http://www.scopus.com/inward/record.url?scp=84899730602&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84899730602&partnerID=8YFLogxK
U2 - 10.1109/IPDPSW.2013.191
DO - 10.1109/IPDPSW.2013.191
M3 - Conference contribution
AN - SCOPUS:84899730602
SN - 9780769549798
T3 - Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013
SP - 915
EP - 922
BT - Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013
PB - IEEE Computer Society
T2 - 2013 IEEE 37th Annual Computer Software and Applications Conference, COMPSAC 2013
Y2 - 22 July 2013 through 26 July 2013
ER -