TY - GEN
T1 - RelaxReplay
T2 - 19th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 2014
AU - Honarmand, Nima
AU - Torrellas, Josep
PY - 2014
Y1 - 2014
N2 - Record and Deterministic Replay (RnR) of multithreaded programs on relaxed-consistency multiprocessors has been a long-standing problem. While there are designs that work for Total Store Ordering (TSO), finding a general solution that is able to record the access reordering allowed by any relaxed-consistency model has proved challenging. This paper presents the first complete solution for hardware-assisted memory race recording that works for any relaxed-consistency model of current processors. With the scheme, called RelaxReplay, we can build an RnR system for any relaxed-consistency model and coherence protocol. RelaxReplay's core innovation is a new way of capturing memory access reordering. Each memory instruction goes through a post-completion in-order counting step that detects any reordering, and efficiently records it. We evaluate RelaxReplay with simulations of an 8-core release-consistent multicore running SPLASH-2 programs. We observe that RelaxReplay induces negligible overhead during recording. In addition, the average size of the log produced is comparable to the log sizes reported for existing solutions, and still very small compared to the memory bandwidth of modern machines. Finally, deterministic replay is efficient and needs minimal hardware support.
AB - Record and Deterministic Replay (RnR) of multithreaded programs on relaxed-consistency multiprocessors has been a long-standing problem. While there are designs that work for Total Store Ordering (TSO), finding a general solution that is able to record the access reordering allowed by any relaxed-consistency model has proved challenging. This paper presents the first complete solution for hardware-assisted memory race recording that works for any relaxed-consistency model of current processors. With the scheme, called RelaxReplay, we can build an RnR system for any relaxed-consistency model and coherence protocol. RelaxReplay's core innovation is a new way of capturing memory access reordering. Each memory instruction goes through a post-completion in-order counting step that detects any reordering, and efficiently records it. We evaluate RelaxReplay with simulations of an 8-core release-consistent multicore running SPLASH-2 programs. We observe that RelaxReplay induces negligible overhead during recording. In addition, the average size of the log produced is comparable to the log sizes reported for existing solutions, and still very small compared to the memory bandwidth of modern machines. Finally, deterministic replay is efficient and needs minimal hardware support.
KW - Memory race recording
KW - Record and deterministic replay
KW - Relaxed consisten
UR - http://www.scopus.com/inward/record.url?scp=84897757780&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84897757780&partnerID=8YFLogxK
U2 - 10.1145/2541940.2541979
DO - 10.1145/2541940.2541979
M3 - Conference contribution
AN - SCOPUS:84897757780
SN - 9781450323055
T3 - International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS
SP - 223
EP - 237
BT - ASPLOS 2014 - 19th International Conference on Architectural Support for Programming Languages and Operating Systems
Y2 - 1 March 2014 through 5 March 2014
ER -