TY - JOUR
T1 - Adaptive partitioning for irregular applications on heterogeneous CPU-GPU chips
AU - Vilches, Antonio
AU - Asenjo, Rafael
AU - Navarro, Angeles
AU - Corbera, Francisco
AU - Gran, Rubén
AU - Garzarán, María
N1 - Publisher Copyright:
© The Authors. Published by Elsevier B.V.
PY - 2015
Y1 - 2015
N2 - Commodity processors are comprised of several CPU cores and one integrated GPU. To fully exploit this type of architectures, one needs to automatically determine how to partition the workload between both devices. This is specially challenging for irregular workloads, where each iteration's work is data dependent and shows control and memory divergence. In this paper, we present a novel adaptive partitioning strategy specially designed for irregular applications running on heterogeneous CPU-GPU chips. The main novelty of this work is that the size of the workload assigned to the GPU and CPU adapts dynamically to maximize the GPU and CPU utilization while balancing the workload among the devices. Our experimental results on an Intel Haswell architecture using a set of irregular benchmarks show that our approach outperforms exhaustive static and adaptive state-of-the-art approaches in terms of performance and energy consumption.
AB - Commodity processors are comprised of several CPU cores and one integrated GPU. To fully exploit this type of architectures, one needs to automatically determine how to partition the workload between both devices. This is specially challenging for irregular workloads, where each iteration's work is data dependent and shows control and memory divergence. In this paper, we present a novel adaptive partitioning strategy specially designed for irregular applications running on heterogeneous CPU-GPU chips. The main novelty of this work is that the size of the workload assigned to the GPU and CPU adapts dynamically to maximize the GPU and CPU utilization while balancing the workload among the devices. Our experimental results on an Intel Haswell architecture using a set of irregular benchmarks show that our approach outperforms exhaustive static and adaptive state-of-the-art approaches in terms of performance and energy consumption.
KW - Adaptive partitioning
KW - Dynamic scheduling
KW - Heterogeneous CPU-GPU chips
KW - Parallel for
UR - http://www.scopus.com/inward/record.url?scp=84939160371&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84939160371&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2015.05.213
DO - 10.1016/j.procs.2015.05.213
M3 - Conference article
AN - SCOPUS:84939160371
SN - 1877-0509
VL - 51
SP - 140
EP - 149
JO - Procedia Computer Science
JF - Procedia Computer Science
IS - 1
T2 - International Conference on Computational Science, ICCS 2002
Y2 - 21 April 2002 through 24 April 2002
ER -