TY - GEN
T1 - Composing low-overhead scheduling strategies for improving performance of scientific applications
AU - Kale, Vivek
AU - Gropp, William D.
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2015.
PY - 2015
Y1 - 2015
N2 - Many different sources of overheads impact the efficiency of a scheduling strategy applied to a parallel loop within a scientific application. In prior work, we handled these overheads using multiple loop scheduling strategies, with each scheduling strategy focusing on mitigating a subset of the overheads. However, mitigating the impact of one source of overhead can lead to an increase in the impact of another source of overhead, and vice versa. In this work, we show that in order to improve efficiency of loop scheduling strategies, one must adapt the loop scheduling strategies so as to handle all overheads simultaneously. To show this, we describe a composition of our existing loop scheduling strategies, and experiment with the composed scheduling strategy on standard benchmarks and application codes. Applying the composed scheduling strategy to three MPI+OpenMP scientific codes run on a cluster of SMPs improves performance an average of 31% over standard OpenMP static scheduling.
AB - Many different sources of overheads impact the efficiency of a scheduling strategy applied to a parallel loop within a scientific application. In prior work, we handled these overheads using multiple loop scheduling strategies, with each scheduling strategy focusing on mitigating a subset of the overheads. However, mitigating the impact of one source of overhead can lead to an increase in the impact of another source of overhead, and vice versa. In this work, we show that in order to improve efficiency of loop scheduling strategies, one must adapt the loop scheduling strategies so as to handle all overheads simultaneously. To show this, we describe a composition of our existing loop scheduling strategies, and experiment with the composed scheduling strategy on standard benchmarks and application codes. Applying the composed scheduling strategy to three MPI+OpenMP scientific codes run on a cluster of SMPs improves performance an average of 31% over standard OpenMP static scheduling.
UR - http://www.scopus.com/inward/record.url?scp=84951752257&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84951752257&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-24595-9_2
DO - 10.1007/978-3-319-24595-9_2
M3 - Conference contribution
AN - SCOPUS:84951752257
SN - 9783319245942
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 18
EP - 29
BT - OpenMP
A2 - Terboven, Christian
A2 - Reble, Pablo
A2 - Müller, Matthias S.
A2 - Chapman, Barbara M.
A2 - de Supinski, Bronis R.
PB - Springer
T2 - 11th International Workshop on OpenMP, IWOMP 2015
Y2 - 1 October 2015 through 2 October 2015
ER -