TY - GEN
T1 - Core architecture optimization for heterogeneous chip multiprocessors
AU - Kumar, Rakesh
AU - Tullsen, Dean M.
AU - Jouppi, Norman P.
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2006
Y1 - 2006
N2 - Previous studies have demonstrated the advantages of single-ISA heterogeneous multi-core architectures for power and performance. However, none of those studies examined how to design such a processor; instead, they started with an assumed combination of pre-existing cores. This work assumes the flexibility to design a multi-core architecture from the ground up and seeks to address the following question: what should be the characteristics of the cores for a heterogeneous multi-processor for the highest area or power efficiency? The study is done for varying degrees of thread-level parallelism and for different area and power budgets. The most efficient chip multiprocessors are shown to be heterogeneous, with each core customized to a different subset of application characteristics - no single core is necessarily well suited to all applications. The performance ordering of cores on such processors is different for different applications; there is only a partial ordering among cores in terms of resources and complexity. This methodology produces performance gains as high as 40%. The performance improvements come with the added cost of customization.
AB - Previous studies have demonstrated the advantages of single-ISA heterogeneous multi-core architectures for power and performance. However, none of those studies examined how to design such a processor; instead, they started with an assumed combination of pre-existing cores. This work assumes the flexibility to design a multi-core architecture from the ground up and seeks to address the following question: what should be the characteristics of the cores for a heterogeneous multi-processor for the highest area or power efficiency? The study is done for varying degrees of thread-level parallelism and for different area and power budgets. The most efficient chip multiprocessors are shown to be heterogeneous, with each core customized to a different subset of application characteristics - no single core is necessarily well suited to all applications. The performance ordering of cores on such processors is different for different applications; there is only a partial ordering among cores in terms of resources and complexity. This methodology produces performance gains as high as 40%. The performance improvements come with the added cost of customization.
KW - Computer architecture
KW - Heterogeneous chip multiprocessors
KW - Multi-core architectures
UR - http://www.scopus.com/inward/record.url?scp=34247174509&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34247174509&partnerID=8YFLogxK
U2 - 10.1145/1152154.1152162
DO - 10.1145/1152154.1152162
M3 - Conference contribution
AN - SCOPUS:34247174509
SN - 159593264X
SN - 9781595932648
T3 - Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
SP - 23
EP - 32
BT - PACT 2006 - Proceedings of the Fifteenth International Conference on Parallel Architectures and Compilation Techniques
T2 - PACT 2006 - 15th International Conference on Parallel Architectures and Compilation Techniques
Y2 - 16 September 2006 through 20 September 2006
ER -