TY - GEN
T1 - Hierarchical overlapped tiling
AU - Zhou, Xing
AU - Giacalone, Jean Pierre
AU - Garzarán, María Jesús
AU - Kuhn, Robert H.
AU - Ni, Yang
AU - Padua, David
PY - 2012
Y1 - 2012
N2 - This paper introduces hierarchical overlapped tiling, a transformation that applies loop tiling and fusion to conventional loops. Overlapped tiling is a useful transformation to reduce communication overhead, but it may also generate a significant amount of redundant computation. Hierarchical overlapped tiling performs overlapped tiling hierarchically to balance communication overhead and redundant computation, and thus has the potential to provide better performance. In this paper, we describe the hierarchical overlapped tiling optimization and its implementation in an OpenCL compiler. We also evaluate the effectiveness of this optimization using 8 programs that implement different forms of stencil computation. Our results show that hierarchical overlapped tiling achieves an average 37% speedup over traditional tiling on a 32-core workstation.
AB - This paper introduces hierarchical overlapped tiling, a transformation that applies loop tiling and fusion to conventional loops. Overlapped tiling is a useful transformation to reduce communication overhead, but it may also generate a significant amount of redundant computation. Hierarchical overlapped tiling performs overlapped tiling hierarchically to balance communication overhead and redundant computation, and thus has the potential to provide better performance. In this paper, we describe the hierarchical overlapped tiling optimization and its implementation in an OpenCL compiler. We also evaluate the effectiveness of this optimization using 8 programs that implement different forms of stencil computation. Our results show that hierarchical overlapped tiling achieves an average 37% speedup over traditional tiling on a 32-core workstation.
KW - Compiler optimization
KW - Loop tiling and fusion
KW - Stencil computation
UR - http://www.scopus.com/inward/record.url?scp=84863438547&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863438547&partnerID=8YFLogxK
U2 - 10.1145/2259016.2259044
DO - 10.1145/2259016.2259044
M3 - Conference contribution
AN - SCOPUS:84863438547
SN - 9781605586359
T3 - Proceedings - International Symposium on Code Generation and Optimization, CGO 2012
SP - 207
EP - 218
BT - Proceedings - International Symposium on Code Generation and Optimization, CGO 2012
T2 - 10th International Symposium on Code Generation and Optimization, CGO 2012
Y2 - 31 March 2012 through 4 April 2012
ER -