Analyzing throughput of GPGPUs exploiting within-die core-to-core frequency variation

Jungseob Lee, Paritosh Pratap Ajgaonkar, Nam Sung Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The state-of-the-art general-purpose graphic processing units (GPGPUs) can offer very high computational throughput for general-purpose, highly-parallel applications using hundreds of available on-chip cores. Meanwhile, as technology is scaled down below 65nm, each core's maximum frequency varies significantly due to increasing within-die variations. This, in turn, diminishes the throughput improvement of GPGPUs through technology scaling because the maximum frequency is often limited by the slowest core. In this paper, we investigate two techniques that can mitigate the impact of frequency variations on GPGPU's throughput: 1) running each core at its maximum frequency independently and 2) disabling the slowest cores. Both can maximize GPGPU's frequency at either the individual core or entire processor level. Our experimental results using a GPGPU simulator and a 32nm technology show that the first and second techniques can improve the throughput of compute- and problem-size-bounded applications by up to 32% and 19%, respectively.

Original languageEnglish (US)
Title of host publicationISPASS 2011 - IEEE International Symposium on Performance Analysis of Systems and Software
Pages237-246
Number of pages10
DOIs
StatePublished - May 30 2011
Externally publishedYes
EventIEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2011 - Austin, TX, United States
Duration: Apr 10 2011Apr 12 2011

Publication series

NameISPASS 2011 - IEEE International Symposium on Performance Analysis of Systems and Software

Other

OtherIEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2011
Country/TerritoryUnited States
CityAustin, TX
Period4/10/114/12/11

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Analyzing throughput of GPGPUs exploiting within-die core-to-core frequency variation'. Together they form a unique fingerprint.

Cite this