Data forwarding in scalable shared-memory multiprocessors

D. A. Koufaty, X. Chen, D. K. Poulsen, J. Torrellas

Research output: Chapter in Book/Report/Conference proceedingConference contribution


Scalable shared-memory multiprocessors are often slowed down by long-latency memory accesses. One way to cope with this problem is to use data forwarding to overlap memory accesses with computation. With data forwarding, when a processor produces a datum, in addition to updating its cache, it sends a copy of the datum to the caches of the processors that the compiler identified as consumers of it. As a result, when the consumer processors access the datum, they find it in their caches. This paper addresses two main issues. First, it presents a framework for a compiler algorithm for forwarding. Second, using address traces, it evaluates the performance impact of different levels of support for forwarding. Our simulations of a 32-processor machine show that, on average, a slightly-optimistic support for forwarding speeds up five applications by 50% for large caches and 30% for small caches. For large caches, most read sharing misses can be eliminated, while for small caches, forwarding rarely increases the number of conflict misses. Overall, support for forwarding in shared-memory multiprocessors promises to deliver good application speedups.

Original languageEnglish (US)
Title of host publicationProceedings of the 9th International Conference on Supercomputing, ICS 1995
PublisherAssociation for Computing Machinery
Number of pages10
ISBN (Electronic)0897917286
StatePublished - Jul 3 1995
Event9th International Conference on Supercomputing, ICS 1995 - Barcelona, Spain
Duration: Jul 3 1995Jul 7 1995

Publication series

NameProceedings of the International Conference on Supercomputing
VolumePart F129361


Other9th International Conference on Supercomputing, ICS 1995

ASJC Scopus subject areas

  • Computer Science(all)


Dive into the research topics of 'Data forwarding in scalable shared-memory multiprocessors'. Together they form a unique fingerprint.

Cite this