TY - JOUR
T1 - POSTER
T2 - 25th International Conference on Parallel Architectures and Compilation Techniques, PACT 2016
AU - Srivastava, Prakalp
AU - Kotsifakou, Maria
AU - Sinclair, Matthew D.
AU - Komuravelli, Rakesh
AU - Adve, Vikram
AU - Adve, Sarita
N1 - Funding Information:
This work was supported in part by the National Science Foundation (grant number CCF 13-02641), and by C-FAR, SRC STARnet Center sponsored by MARCO and DARPA.
PY - 2016
Y1 - 2016
N2 - Programming heterogeneous parallel systems can be extremely complex because a single system may include multiple different parallelism models, instruction sets, and memory hierarchies, and different systems use different combinations of these features. We propose a carefully designed parallel abstraction of heterogeneous hardware-a hierarchical dataflow graph with shared memory and vector instructions-that is able to capture the parallelism in a wide range of popular parallel hardware. We use this abstraction, which we call hVISC, to define a Virtual Instruction Set Architecture (ISA) that aims to address both functional portability and performance portability across heterogeneous systems. hVISC is more general than existing virtual instruction sets such as PTX, HSAIL and SPIR, e.g., it can capture both streaming parallelism and general dataflow parallelism.
AB - Programming heterogeneous parallel systems can be extremely complex because a single system may include multiple different parallelism models, instruction sets, and memory hierarchies, and different systems use different combinations of these features. We propose a carefully designed parallel abstraction of heterogeneous hardware-a hierarchical dataflow graph with shared memory and vector instructions-that is able to capture the parallelism in a wide range of popular parallel hardware. We use this abstraction, which we call hVISC, to define a Virtual Instruction Set Architecture (ISA) that aims to address both functional portability and performance portability across heterogeneous systems. hVISC is more general than existing virtual instruction sets such as PTX, HSAIL and SPIR, e.g., it can capture both streaming parallelism and general dataflow parallelism.
KW - CUDA
KW - GPU
KW - heterogeneous systems
KW - multicore
KW - opencl
KW - parallel compiler ir
KW - ptx
KW - virtual instruction set
UR - http://www.scopus.com/inward/record.url?scp=84989282504&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84989282504&partnerID=8YFLogxK
U2 - 10.1145/2967938.2976039
DO - 10.1145/2967938.2976039
M3 - Conference article
AN - SCOPUS:84989282504
SP - 443
EP - 445
JO - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
JF - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
SN - 1089-795X
Y2 - 11 September 2016 through 15 September 2016
ER -