Trireme: Exploration of Hierarchical Multi-level Parallelism for Hardware Acceleration

Georgios Zacharopoulos, Adel Ejjeh, Ying Jing, En Yu Yang, Tianyu Jia, Iulian Brumar, Jeremy Intan, Muhammad Huzaifa, Sarita Adve, Vikram Adve, Gu Yeon Wei, David Brooks

Research output: Contribution to journalArticlepeer-review

Abstract

The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution, including loop level, task level, and pipeline parallelism. To assist the design process and expose every possible level of parallelism, we present Trireme, a fully automated tool-chain that explores multiple levels of parallelism and produces domain-specific accelerator designs and configurations that maximize performance, given an area budget. FPGA SoCs were used as target platforms, and Catapult HLS [7] was used to synthesize RTL using a commercial 12 nm FinFET technology. Experiments on demanding benchmarks from the XR domain revealed a speedup of up to 20×, as well as a speedup of up to 37× for smaller applications, compared to software-only implementations.

Original languageEnglish (US)
Article number53
JournalACM Transactions on Embedded Computing Systems
Volume22
Issue number3
DOIs
StatePublished - Apr 20 2023

Keywords

  • ASICs
  • Accelerators
  • compiler techniques and optimizations
  • design tools
  • heterogeneous systems parallelism

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Trireme: Exploration of Hierarchical Multi-level Parallelism for Hardware Acceleration'. Together they form a unique fingerprint.

Cite this