TY - GEN
T1 - Chasing away RAts
T2 - 44th Annual International Symposium on Computer Architecture - ISCA 2017
AU - Sinclair, Matthew D.
AU - Alsop, Johnathan
AU - Adve, Sarita V.
N1 - Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/6/24
Y1 - 2017/6/24
N2 - An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous systems. In a widely adopted approach, the memory model guarantees sequential consistency (SC) as long as programmers obey certain rules. The popular data-race-free-0 (DRF0) model exemplifes this SC-centric approach by requiring programmers to avoid data races. Recent industry models, however, have extended such SC-centric models to incorporate relaxed atomics. These extensions can improve performance, but are diffcult to specify formally and use correctly. This work addresses the impact of relaxed atomics on consistency models for heterogeneous systems in two ways. First, we introduce a new model, Data-Race-Free-Relaxed (DRFrlx), that extends DRF0 to provide SC-centric semantics for the common use cases of relaxed atomics. Second, we evaluate the performance of relaxed atomics in CPU-GPU systems for these use cases. We fnd mixed results-for most cases, relaxed atomics provide only a small beneft in execution time, but for some cases, they help signifcantly (e.g., up to 51% for DRFrlx over DRF0).
AB - An unambiguous and easy-to-understand memory consistency model is crucial for ensuring correct synchronization and guiding future design of heterogeneous systems. In a widely adopted approach, the memory model guarantees sequential consistency (SC) as long as programmers obey certain rules. The popular data-race-free-0 (DRF0) model exemplifes this SC-centric approach by requiring programmers to avoid data races. Recent industry models, however, have extended such SC-centric models to incorporate relaxed atomics. These extensions can improve performance, but are diffcult to specify formally and use correctly. This work addresses the impact of relaxed atomics on consistency models for heterogeneous systems in two ways. First, we introduce a new model, Data-Race-Free-Relaxed (DRFrlx), that extends DRF0 to provide SC-centric semantics for the common use cases of relaxed atomics. Second, we evaluate the performance of relaxed atomics in CPU-GPU systems for these use cases. We fnd mixed results-for most cases, relaxed atomics provide only a small beneft in execution time, but for some cases, they help signifcantly (e.g., up to 51% for DRFrlx over DRF0).
KW - Data-race-free Models
KW - GPGPU
KW - Memory consistency
KW - Relaxed atomics
UR - http://www.scopus.com/inward/record.url?scp=85025580671&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85025580671&partnerID=8YFLogxK
U2 - 10.1145/3079856.3080206
DO - 10.1145/3079856.3080206
M3 - Conference contribution
AN - SCOPUS:85025580671
T3 - Proceedings - International Symposium on Computer Architecture
SP - 161
EP - 174
BT - ISCA 2017 - 44th Annual International Symposium on Computer Architecture - Conference Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 June 2017 through 28 June 2017
ER -