TY - JOUR
T1 - Architectural support for parallel reductions in scalable shared-memory multiprocessors
AU - Garzarán, María Jesús
AU - Prvulovic, Milos
AU - Zhang, Ye
AU - Jula, Alin
AU - Yu, Hao
AU - Rauchwerger, Lawrence
AU - Torrellas, Josep
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2001
Y1 - 2001
N2 - Reductions are important and time-consuming operations in many scientific codes. Effective parallelization of reductions is a critical transformation for loop parallelization, especially for sparse, dynamic applications. Unfortunately, conventional reduction parallelization algorithms are not scalable. In this paper, we present new architectural support that significantly speeds-up parallel reduction and makes it scalable in shared-memory multiprocessors. The required architectural changes are mostly confined to the directory controllers. Experimental results based on simulations show that the proposed support is very effective. While conventional software-only reduction parallelization delivers average speedups of only 2.7 for 16 processors, our scheme delivers average speedups of 7.6.
AB - Reductions are important and time-consuming operations in many scientific codes. Effective parallelization of reductions is a critical transformation for loop parallelization, especially for sparse, dynamic applications. Unfortunately, conventional reduction parallelization algorithms are not scalable. In this paper, we present new architectural support that significantly speeds-up parallel reduction and makes it scalable in shared-memory multiprocessors. The required architectural changes are mostly confined to the directory controllers. Experimental results based on simulations show that the proposed support is very effective. While conventional software-only reduction parallelization delivers average speedups of only 2.7 for 16 processors, our scheme delivers average speedups of 7.6.
UR - http://www.scopus.com/inward/record.url?scp=0035177716&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0035177716&partnerID=8YFLogxK
U2 - 10.1109/PACT.2001.953304
DO - 10.1109/PACT.2001.953304
M3 - Article
AN - SCOPUS:0035177716
SN - 1089-795X
SP - 243
EP - 254
JO - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
JF - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
ER -