ScalableBulk: Scalable cache coherence for atomic blocks in a lazy environment

Xuehai Qian, Wonsun Ahn, Josep Torrellas

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recently-proposed architectures that continuously operate on atomic blocks of instructions (also called chunks) can boost the programmability and performance of shared-memory multiprocessing. However, they must support chunk operations very efficiently. In particular, in lazy conflict-detection environments, it is key that they provide scalable chunk commits. Unfortunately, current proposals typically fail to enable maximum overlap of conflict-free chunk commits. This paper presents a novel directory-based protocol that enables highly-overlapped, scalable chunk commits. The protocol, called Scal-ableBulk, builds on the previously-proposed BulkSC protocol. It introduces three general hardware primitives for scalable commit: preventing access to a set of directory entries, grouping directory modules, and initiating the commit optimistically. Our results with SPLASH-2 and PARSEC codes with up to 64 processors show that ScalableBulk enables highly-overlapped chunk commits and delivers scalable performance. Unlike previously-proposed schemes, it removes practically all commit stalls.

Original languageEnglish (US)
Title of host publicationProceedings - 43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2010
Pages447-458
Number of pages12
DOIs
StatePublished - 2010
Event43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2010 - Atlanta, GA, United States
Duration: Dec 4 2010Dec 8 2010

Publication series

NameProceedings of the Annual International Symposium on Microarchitecture, MICRO
ISSN (Print)1072-4451

Other

Other43rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2010
Country/TerritoryUnited States
CityAtlanta, GA
Period12/4/1012/8/10

ASJC Scopus subject areas

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'ScalableBulk: Scalable cache coherence for atomic blocks in a lazy environment'. Together they form a unique fingerprint.

Cite this