Abstract
In this paper we address the important problem of instruction fetch for future wide issue superscalar processors. Our approach focuses on understanding the interaction between software and hardware techniques targeting an increase in the instruction fetch bandwidth. That is the objective, for instance, of the Hardware Trace Cache (HTC). We design a profile based code reordering technique which targets a maximization of the sequentiality of instructions, while still trying to minimize instruction cache misses. We call our software approach, Software Trace Cache (STC). We evaluate our software approach, and then compare it with the HTC and the combination of both techniques. Our results show that for large codes with few loops and deterministic execution sequences like databases and some SPEC-INT codes, the STC offers similar, or better, results than a HTC. Moreover, when combining the software and hardware approaches, we obtain encouraging results: the STC and a small HTC offer similar performance to a much larger HTC alone.
Original language | English (US) |
---|---|
Pages | 119-126 |
Number of pages | 8 |
DOIs | |
State | Published - 1999 |
Externally published | Yes |
Event | Proceedings of the 1999 13th ACM International Conference on Supercomputing, ICS'99 - Rhodes, Greece Duration: Jun 20 1999 → Jun 25 1999 |
Other
Other | Proceedings of the 1999 13th ACM International Conference on Supercomputing, ICS'99 |
---|---|
City | Rhodes, Greece |
Period | 6/20/99 → 6/25/99 |
ASJC Scopus subject areas
- Computer Science(all)