TY - GEN
T1 - Quantifying the effectiveness of load balance algorithms
AU - Pearce, Olga
AU - Gamblin, Todd
AU - De Supinski, Bronis R.
AU - Schulz, Martin
AU - Amato, Nancy M.
PY - 2012
Y1 - 2012
N2 - Load balance is critical for performance in large parallel applications. An imbalance on today's fastest supercomputers can force hundreds of thousands of cores to idle, and on future exascale machines this cost will increase by over a factor of a thousand. Improving load balance requires a detailed understanding of the amount of computational load per process and an application's simulated domain, but no existing metrics sufficiently account for both factors. Current load balance mechanisms are often integrated into applications and make implicit assumptions about the load. Some strategies place the burden of providing accurate load information, including the decision on when to balance, on the application. Existing application-independent mechanisms simply measure the application load without any knowledge of application elements, which limits them to identifying imbalance without correcting it. Our novel load model couples abstract application information with scalable measurements to derive accurate and actionable load metrics. Using these metrics, we develop a cost model for correcting load imbalance. Our model enables comparisons of the effectiveness of load balancing algorithms in any specific imbalance scenario. Our model correctly selects the algorithm that achieves the lowest runtime in up to 96% of the cases, and can achieve a 19% gain over selecting a single balancing algorithm for all cases.
AB - Load balance is critical for performance in large parallel applications. An imbalance on today's fastest supercomputers can force hundreds of thousands of cores to idle, and on future exascale machines this cost will increase by over a factor of a thousand. Improving load balance requires a detailed understanding of the amount of computational load per process and an application's simulated domain, but no existing metrics sufficiently account for both factors. Current load balance mechanisms are often integrated into applications and make implicit assumptions about the load. Some strategies place the burden of providing accurate load information, including the decision on when to balance, on the application. Existing application-independent mechanisms simply measure the application load without any knowledge of application elements, which limits them to identifying imbalance without correcting it. Our novel load model couples abstract application information with scalable measurements to derive accurate and actionable load metrics. Using these metrics, we develop a cost model for correcting load imbalance. Our model enables comparisons of the effectiveness of load balancing algorithms in any specific imbalance scenario. Our model correctly selects the algorithm that achieves the lowest runtime in up to 96% of the cases, and can achieve a 19% gain over selecting a single balancing algorithm for all cases.
KW - Framework
KW - Load balance
KW - Modeling
KW - Performance
KW - Simulation
UR - http://www.scopus.com/inward/record.url?scp=84864053189&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864053189&partnerID=8YFLogxK
U2 - 10.1145/2304576.2304601
DO - 10.1145/2304576.2304601
M3 - Conference contribution
AN - SCOPUS:84864053189
SN - 9781450313162
T3 - Proceedings of the International Conference on Supercomputing
SP - 185
EP - 194
BT - ICS'12 - Proceedings of the 2012 ACM International Conference on Supercomputing
T2 - 26th ACM International Conference on Supercomputing, ICS'12
Y2 - 25 June 2012 through 29 June 2012
ER -