TY - JOUR
T1 - Decomposing the load - Store queue by function for power reduction and scalability
AU - Baugh, Lee
AU - Zilles, Craig
PY - 2006
Y1 - 2006
N2 - Because they are based on large, content-addressable memories, load-store queues (LSQs) present implementation challenges in superscalar processors. In this paper, we propose an alternate LSQ organization that separates the time-critical forwarding functionality from the process of checking that loads received their correct values. Two main techniques are exploited: First, the store-forwarding logic is accessed only by those loads and stores that are likely to be involved in forwarding, and second, the checking structure is banked by address. The result of these techniques is that the LSQ can be implemented by a collection of small, low-bandwidth structures yielding an estimated three to five times reduction in LSQ dynamic power.
AB - Because they are based on large, content-addressable memories, load-store queues (LSQs) present implementation challenges in superscalar processors. In this paper, we propose an alternate LSQ organization that separates the time-critical forwarding functionality from the process of checking that loads received their correct values. Two main techniques are exploited: First, the store-forwarding logic is accessed only by those loads and stores that are likely to be involved in forwarding, and second, the checking structure is banked by address. The result of these techniques is that the LSQ can be implemented by a collection of small, low-bandwidth structures yielding an estimated three to five times reduction in LSQ dynamic power.
UR - http://www.scopus.com/inward/record.url?scp=33646061342&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33646061342&partnerID=8YFLogxK
U2 - 10.1147/rd.502.0287
DO - 10.1147/rd.502.0287
M3 - Article
AN - SCOPUS:33646061342
SN - 0018-8646
VL - 50
SP - 287
EP - 297
JO - IBM Journal of Research and Development
JF - IBM Journal of Research and Development
IS - 2-3
ER -