Profiling and reducing micro-architecture bottlenecks at the hardware level

Francis B. Moreira, Marco A.Z. Alves, Matthias Diener, Philippe O.A. Navaux, Israel Koren

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Most mechanisms in current superscalar processors use instruction granularity information for speculation, such as branch predictors or prefetchers. However, many of these characteristics can be obtained at the basic block level, increasing the amount of code that can be covered while requiring less space to store the data. Furthermore, the code can be profiled more accurately and provide a higher variety of information by analyzing different instruction types inside a block. Because of these advantages, block-level analysis can offer more opportunities for mechanisms that use this information. For example, it is possible to integrate information about branch prediction and memory accesses to provide precise information for speculative mechanisms, increasing accuracy and performance. We propose a Block-Level Architecture Profiler (BLAP), an online mechanism that profiles bottlenecks at the micro architectural level, such as delinquent memory loads, hard-topredict branches and contention for functional units. BLAP works at the basic block level, providing information that can be used to reduce the impact of these bottlenecks. A prefetch dropping mechanism and a memory controller policy were developed to use the profiled information provided by BLAP. Together, these mechanisms are able to improve performance by up to 17.39% (3.90% on average). Our technique showed average gains of 13.14% when evaluated under high memory pressure due to highly aggressive prefetch.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 26th International Symposium
PublisherIEEE Computer Society
Pages222-229
Number of pages8
ISBN (Electronic)9781479969043
DOIs
StatePublished - Dec 1 2014
Externally publishedYes
Event26th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2014 - Paris, France
Duration: Oct 22 2014Oct 24 2014

Publication series

NameProceedings - Symposium on Computer Architecture and High Performance Computing
ISSN (Print)1550-6533

Other

Other26th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2014
Country/TerritoryFrance
CityParis
Period10/22/1410/24/14

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'Profiling and reducing micro-architecture bottlenecks at the hardware level'. Together they form a unique fingerprint.

Cite this