TY - GEN
T1 - Cohesion
T2 - 37th International Symposium on Computer Architecture, ISCA 2010
AU - Kelm, John H.
AU - Johnson, Daniel R.
AU - Tuohy, William
AU - Lumetta, Steven Sam
AU - Patel, Sanjay Jeram
PY - 2010
Y1 - 2010
N2 - Two broad classes of memory models are available today: models with hardware cache coherence, used in conventional chip multiprocessors, and models that rely upon software to manage coherence, found in compute accelerators. In some systems, both types of models are supported using disjoint address spaces and/or physical memories. In this paper we present Cohesion, a hybrid memory model that enables fine-grained temporal reassignment of data between hardware-managed and software-managed coherence domains, allowing a system to support both. Cohesion can be used to dynamically adapt to the sharing needs of both applications and runtimes. Cohesion requires neither copy operations nor multiple address spaces. Cohesion offers the benefits of reduced message traffic and on-die directory overhead when software-managed coherence can be used and the advantages of hardware coherence for cases in which software-managed coherence is impractical. We demonstrate our protocol using a hierarchical, cached 1024-core processor with a single address space that supports both software-enforced coherence and a directory-based hardware coherence protocol. Relative to an optimistic, hardware-coherent baseline, a realizable Cohesion design achieves competitive performance with a 2× reduction in message traffic, 2.1× reduction in directory utilization, and greater robustness to on-die directory capacity.
AB - Two broad classes of memory models are available today: models with hardware cache coherence, used in conventional chip multiprocessors, and models that rely upon software to manage coherence, found in compute accelerators. In some systems, both types of models are supported using disjoint address spaces and/or physical memories. In this paper we present Cohesion, a hybrid memory model that enables fine-grained temporal reassignment of data between hardware-managed and software-managed coherence domains, allowing a system to support both. Cohesion can be used to dynamically adapt to the sharing needs of both applications and runtimes. Cohesion requires neither copy operations nor multiple address spaces. Cohesion offers the benefits of reduced message traffic and on-die directory overhead when software-managed coherence can be used and the advantages of hardware coherence for cases in which software-managed coherence is impractical. We demonstrate our protocol using a hierarchical, cached 1024-core processor with a single address space that supports both software-enforced coherence and a directory-based hardware coherence protocol. Relative to an optimistic, hardware-coherent baseline, a realizable Cohesion design achieves competitive performance with a 2× reduction in message traffic, 2.1× reduction in directory utilization, and greater robustness to on-die directory capacity.
KW - Accelerator
KW - Cache coherence
KW - Computer architecture
UR - http://www.scopus.com/inward/record.url?scp=77954960337&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77954960337&partnerID=8YFLogxK
U2 - 10.1145/1815961.1816019
DO - 10.1145/1815961.1816019
M3 - Conference contribution
AN - SCOPUS:77954960337
SN - 9781450300520
T3 - Proceedings - International Symposium on Computer Architecture
SP - 429
EP - 440
BT - ISCA 2010 - The 37th Annual International Symposium on Computer Architecture, Conference Proceedings
Y2 - 19 June 2010 through 23 June 2010
ER -