Characterization and improvement of load/store cache-based prefetching

Pablo Ibanez, Victor Vinals, Jose L. Briz, Maria J. Garzaran

Research output: Contribution to conferencePaperpeer-review

Abstract

A common mechanism to perform hardware-based prefetching for regular accesses to arrays and chained lists is based on a Load/Store cache (LSC). An LSC associates the address of a 1d/st instruction with its individual behavior at every entry. We show that the implementation cost of the LSC is rather high, and that using it is inefficient. We aim to decrease the cost of the LSC but not its performance. This may be done preventing useless instructions from being stored in the LSC. We propose eliminating those instructions that never miss, and those that follow a sequential pattern. This may be carried out by inserting a 1d/st instruction in the LSC whenever it misses in the data cache (on-miss insertion), and issuing sequential prefetching simultaneously. After having analyzed the performance of this proposal through a cycle-by-cycle simulation over a set of 25 benchmarks selected from SPEC95, SPEC92 and Perfect Club, we conclude that an LSC of only 8 entries, which combines on-miss insertion and sequential prefetching, performs better than a conventional LSC of 512 entries. We think that the low cost of the proposal makes it worth being taken into account for the development of future microprocessors.

Original languageEnglish (US)
Pages369-376
Number of pages8
DOIs
StatePublished - 1998
Externally publishedYes
EventProceedings of the 1998 International Conference on Supercomputing - Melbourne, Aust
Duration: Jul 13 1998Jul 17 1998

Other

OtherProceedings of the 1998 International Conference on Supercomputing
CityMelbourne, Aust
Period7/13/987/17/98

ASJC Scopus subject areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Characterization and improvement of load/store cache-based prefetching'. Together they form a unique fingerprint.

Cite this