TY - GEN
T1 - DeLorean
T2 - ISCA 2008, 35th International Symposium on Computer Architecture
AU - Montesinos, Pablo
AU - Ceze, Luis
AU - Torrellas, Josep
PY - 2008
Y1 - 2008
N2 - Support for deterministic replay of multithreaded execution can greatly help in finding concurrency bugs. For highest effectiveness, replay schemes should (i) record at production-run speed, (ii) keep their logging requirements minute, and (iii) replay at a speed similar to that of the initial execution. In this paper, we propose a new substrate for deterministic replay that provides substantial advances along these axes. In our proposal, processors execute blocks of instructions atomic-ally, as in transactional memory or speculative multithreading, and the system only needs to record the commit order of these blocks. We call our scheme DeLorean. Our results show that DeLorean records execution at a speed similar to that of Release Consistency (RC) execution, and replays at about 82% of its speed. In contrast, most current schemes only record at the speed of Sequential Consistency (SC) execution. Moreover, DeLorean only needs 7.5% of the log size needed by a state-of-the-art scheme. Finally, DeLorean can be configured to need only 0.6% of the log size of the state-of-the-art scheme at the cost of recording at 86% of RC's execution speed - still faster than SC. In this configuration, the log of an 8-processor 5-GHz machine is estimated to be only about 20GB per day.
AB - Support for deterministic replay of multithreaded execution can greatly help in finding concurrency bugs. For highest effectiveness, replay schemes should (i) record at production-run speed, (ii) keep their logging requirements minute, and (iii) replay at a speed similar to that of the initial execution. In this paper, we propose a new substrate for deterministic replay that provides substantial advances along these axes. In our proposal, processors execute blocks of instructions atomic-ally, as in transactional memory or speculative multithreading, and the system only needs to record the commit order of these blocks. We call our scheme DeLorean. Our results show that DeLorean records execution at a speed similar to that of Release Consistency (RC) execution, and replays at about 82% of its speed. In contrast, most current schemes only record at the speed of Sequential Consistency (SC) execution. Moreover, DeLorean only needs 7.5% of the log size needed by a state-of-the-art scheme. Finally, DeLorean can be configured to need only 0.6% of the log size of the state-of-the-art scheme at the cost of recording at 86% of RC's execution speed - still faster than SC. In this configuration, the log of an 8-processor 5-GHz machine is estimated to be only about 20GB per day.
UR - http://www.scopus.com/inward/record.url?scp=52649147142&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=52649147142&partnerID=8YFLogxK
U2 - 10.1109/ISCA.2008.36
DO - 10.1109/ISCA.2008.36
M3 - Conference contribution
AN - SCOPUS:52649147142
SN - 9780769531748
T3 - Proceedings - International Symposium on Computer Architecture
SP - 289
EP - 300
BT - ISCA 2008, Proceedings - 35th International Symposium on Computer Architecture
Y2 - 21 June 2008 through 25 June 2008
ER -