A Case for Fine-grain Coherence Specialization in Heterogeneous Systems

Johnathan Alsop, Weon Taek Na, Matthew D. Sinclair, Samuel Grayson, Sarita Adve

Research output: Contribution to journalArticlepeer-review

Abstract

Hardware specialization is becoming a key enabler of energy-efficient performance. Future systems will be increasingly heterogeneous, integrating multiple specialized and programmable accelerators, each with different memory demands. Traditionally, communication between accelerators has been inefficient, typically orchestrated through explicit DMA transfers between different address spaces. More recently, industry has proposed unified coherent memory which enables implicit data movement and more data reuse, but often these interfaces limit the coherence flexibility available to heterogeneous systems. This paper demonstrates the benefits of fine-grained coherence specialization for heterogeneous systems. We propose an architecture that enables low-complexity independent specialization of each individual coherence request in heterogeneous workloads by building upon a simple and flexible baseline coherence interface, Spandex. We then describe how to optimize individual memory requests to improve cache reuse and performance-critical memory latency in emerging heterogeneous workloads. Collectively, our techniques enable significant gains, reducing execution time by up to 61% or network traffic by up to 99% while adding minimal complexity to the Spandex protocol.

Original languageEnglish (US)
Article number41
JournalACM Transactions on Architecture and Code Optimization
Volume19
Issue number3
DOIs
StatePublished - Aug 22 2022

Keywords

  • GPUs
  • Shared memory systems
  • caches
  • coherence

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'A Case for Fine-grain Coherence Specialization in Heterogeneous Systems'. Together they form a unique fingerprint.

Cite this