Decomposing the load - Store queue by function for power reduction and scalability

Lee Baugh, Craig Zilles

Research output: Contribution to journalArticlepeer-review


Because they are based on large, content-addressable memories, load-store queues (LSQs) present implementation challenges in superscalar processors. In this paper, we propose an alternate LSQ organization that separates the time-critical forwarding functionality from the process of checking that loads received their correct values. Two main techniques are exploited: First, the store-forwarding logic is accessed only by those loads and stores that are likely to be involved in forwarding, and second, the checking structure is banked by address. The result of these techniques is that the LSQ can be implemented by a collection of small, low-bandwidth structures yielding an estimated three to five times reduction in LSQ dynamic power.

Original languageEnglish (US)
Pages (from-to)287-297
Number of pages11
JournalIBM Journal of Research and Development
Issue number2-3
StatePublished - 2006

ASJC Scopus subject areas

  • General Computer Science


Dive into the research topics of 'Decomposing the load - Store queue by function for power reduction and scalability'. Together they form a unique fingerprint.

Cite this