TY - GEN
T1 - Predicting memory-access cost based on data-access patterns
AU - Byna, Surendra
AU - Sun, Xian He
AU - Gropp, William D
AU - Thakur, Rajeev
N1 - Copyright:
Copyright 2012 Elsevier B.V., All rights reserved.
PY - 2004
Y1 - 2004
N2 - Improving memory performance at software level is more effective in reducing the rapidly expanding gap between processor and memory performance. Loop transformations (e.g. loop unrolling, loop tiling) and array restructuring optimizations improve the memory performance by increasing the locality of memory accesses. To find the best optimization parameters at runtime, we need a fast and simple analytical model to predict the memory access cost. Most of the existing models are complex and impractical to be integrated in the runtime tuning systems. In this paper, we propose a simple, fast and reasonably accurate model that is capable of predicting the memory access cost based on a wide range of data access patterns that appear in many scientific applications.
AB - Improving memory performance at software level is more effective in reducing the rapidly expanding gap between processor and memory performance. Loop transformations (e.g. loop unrolling, loop tiling) and array restructuring optimizations improve the memory performance by increasing the locality of memory accesses. To find the best optimization parameters at runtime, we need a fast and simple analytical model to predict the memory access cost. Most of the existing models are complex and impractical to be integrated in the runtime tuning systems. In this paper, we propose a simple, fast and reasonably accurate model that is capable of predicting the memory access cost based on a wide range of data access patterns that appear in many scientific applications.
UR - http://www.scopus.com/inward/record.url?scp=20444490760&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=20444490760&partnerID=8YFLogxK
U2 - 10.1109/CLUSTR.2004.1392630
DO - 10.1109/CLUSTR.2004.1392630
M3 - Conference contribution
AN - SCOPUS:20444490760
SN - 0780386949
T3 - Proceedings - IEEE International Conference on Cluster Computing, ICCC
SP - 327
EP - 336
BT - 2004 IEEE International Conference on Cluster Computing, ICCC 2004
T2 - 2004 IEEE International Conference on Cluster Computing, ICCC 2004
Y2 - 20 September 2004 through 23 September 2004
ER -