TY - GEN
T1 - Programming heterogeneous systems
AU - Kunzman, David M.
AU - Kalé, Laxmikant V.
PY - 2011
Y1 - 2011
N2 - Various specialized hardware designs, such as Cell, GPGPUs, and MIC, have gained traction as alternative hardware designs capable of delivering higher flop rates than conventional designs. However, a drawback of these accelerators is that they simultaneously increase programmer burden in terms of code complexity and decrease portability by requiring hardware specific code to be interleaved throughout application code. The structure of the application code itself typically requires modification when targeting accelerators. Further, balancing the application workload across the cores becomes problematic, especially if a given computation must be split across a mixture of core types with variable performance characteristics. Our research aims to address the complications that arise in heterogeneous systems by understanding how the application build process and underlying runtime system can assist the programmer in developing parallel programs that target such platforms. We are developing a unified programming model that can be used for all cores, host and accelerator alike. We discuss the modifications we have made to the runtime system, along with discussing future modifications. We have demonstrated a simple molecular dynamics (MD) program executing on a mixture of x86 and Cell processors without requiring hardware specific code within the application code.
AB - Various specialized hardware designs, such as Cell, GPGPUs, and MIC, have gained traction as alternative hardware designs capable of delivering higher flop rates than conventional designs. However, a drawback of these accelerators is that they simultaneously increase programmer burden in terms of code complexity and decrease portability by requiring hardware specific code to be interleaved throughout application code. The structure of the application code itself typically requires modification when targeting accelerators. Further, balancing the application workload across the cores becomes problematic, especially if a given computation must be split across a mixture of core types with variable performance characteristics. Our research aims to address the complications that arise in heterogeneous systems by understanding how the application build process and underlying runtime system can assist the programmer in developing parallel programs that target such platforms. We are developing a unified programming model that can be used for all cores, host and accelerator alike. We discuss the modifications we have made to the runtime system, along with discussing future modifications. We have demonstrated a simple molecular dynamics (MD) program executing on a mixture of x86 and Cell processors without requiring hardware specific code within the application code.
UR - https://www.scopus.com/pages/publications/83455220794
UR - https://www.scopus.com/inward/citedby.url?scp=83455220794&partnerID=8YFLogxK
U2 - 10.1109/IPDPS.2011.377
DO - 10.1109/IPDPS.2011.377
M3 - Conference contribution
AN - SCOPUS:83455220794
SN - 9780769543857
T3 - IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum
SP - 2061
EP - 2064
BT - 2011 IEEE International Symposium on Parallel and Distributed Processing, Workshops and Phd Forum, IPDPSW 2011
T2 - 25th IEEE International Parallel and Distributed Processing Symposium, Workshops and Phd Forum, IPDPSW 2011
Y2 - 16 May 2011 through 20 May 2011
ER -