TY - GEN
T1 - An adaptive algorithm selection framework
AU - Yu, Hao
AU - Zhang, Dongmin
AU - Rauchwerger, Lawrence
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2004
Y1 - 2004
N2 - Irregular and dynamic memory reference patterns can cause performance variations for low level algorithms in general and for parallel algorithms in particular. We present an adaptive algorithm selection framework which can collect and interpret the inputs of a particular instance of a parallel algorithm and select the best performing one from a an existing library. In this paper present the dynamic selection of parallel reduction algorithms. First we introduce a set of high-level parameters that can characterize different parallel reduction algorithms. Then we describe an off-line, systematic process to generate predictive models which can be used for run-time algorithm selection. Our experiments show that our framework: (a) selects the most appropriate algorithms in 85% of the cases studied, (b) overall delievers 98% of the optimal performance, (c) adoptively selects the best algorithms for dynamic phases of a running program (resulting in performance improvements otherwise not possible), and (d) adapts to the underlying machine architecture (tested on IBM Regatta and HP V-Class systems).
AB - Irregular and dynamic memory reference patterns can cause performance variations for low level algorithms in general and for parallel algorithms in particular. We present an adaptive algorithm selection framework which can collect and interpret the inputs of a particular instance of a parallel algorithm and select the best performing one from a an existing library. In this paper present the dynamic selection of parallel reduction algorithms. First we introduce a set of high-level parameters that can characterize different parallel reduction algorithms. Then we describe an off-line, systematic process to generate predictive models which can be used for run-time algorithm selection. Our experiments show that our framework: (a) selects the most appropriate algorithms in 85% of the cases studied, (b) overall delievers 98% of the optimal performance, (c) adoptively selects the best algorithms for dynamic phases of a running program (resulting in performance improvements otherwise not possible), and (d) adapts to the underlying machine architecture (tested on IBM Regatta and HP V-Class systems).
UR - http://www.scopus.com/inward/record.url?scp=10444220869&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=10444220869&partnerID=8YFLogxK
U2 - 10.1109/PACT.2004.1342561
DO - 10.1109/PACT.2004.1342561
M3 - Conference contribution
AN - SCOPUS:10444220869
SN - 0769522297
T3 - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
SP - 278
EP - 289
BT - Proceedings - 13th International Conference on Parallel Architectures and Compilation Techniques (PACT 2004)
T2 - Proceedings - 13th International Conference on Parallel Architectures and Compilation Techniques (PACT 2004)
Y2 - 29 September 2004 through 3 October 2004
ER -