TY - GEN
T1 - Improving memory scheduling via processor-side load criticality information
AU - Ghose, Saugata
AU - Lee, Hyodong
AU - Martínez, José F.
N1 - Copyright:
Copyright 2013 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - We hypothesize that performing processor-side analysis of load instructions, and providing this pre-digested information to memory schedulers judiciously, can increase the sophistication of memory decisions while maintaining a lean memory controller that can take scheduling actions quickly. This is increasingly important as DRAM frequencies continue to increase relative to processor speed. In this paper we propose one such mechanism, pairing up a processor-side load criticality predictor with a lean memory controller that prioritizes load requests based on ranking information supplied from the processor side. Using a sophisticated multi-core simulator that includes a detailed quad-channel DDR3 DRAM model, we demonstrate that this mechanism can improve performance significantly on a CMP, with minimal overhead and virtually no changes to the processor itself. We show that our design compares favorably to several state-of-the-art schedulers.
AB - We hypothesize that performing processor-side analysis of load instructions, and providing this pre-digested information to memory schedulers judiciously, can increase the sophistication of memory decisions while maintaining a lean memory controller that can take scheduling actions quickly. This is increasingly important as DRAM frequencies continue to increase relative to processor speed. In this paper we propose one such mechanism, pairing up a processor-side load criticality predictor with a lean memory controller that prioritizes load requests based on ranking information supplied from the processor side. Using a sophisticated multi-core simulator that includes a detailed quad-channel DDR3 DRAM model, we demonstrate that this mechanism can improve performance significantly on a CMP, with minimal overhead and virtually no changes to the processor itself. We show that our design compares favorably to several state-of-the-art schedulers.
UR - http://www.scopus.com/inward/record.url?scp=84881131887&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84881131887&partnerID=8YFLogxK
U2 - 10.1145/2485922.2485930
DO - 10.1145/2485922.2485930
M3 - Conference contribution
AN - SCOPUS:84881131887
SN - 9781450320795
T3 - Proceedings - International Symposium on Computer Architecture
SP - 84
EP - 95
BT - ISCA 2013 - 40th Annual International Symposium on Computer Architecture, Conference Proceedings
T2 - 40th Annual International Symposium on Computer Architecture, ISCA 2013
Y2 - 23 June 2013 through 27 June 2013
ER -