AutoAI2C: An Automated Hardware Generator for DNN Acceleration On Both FPGA and ASIC

Yongan Zhang, Xiaofan Zhang, Pengfei Xu, Yang Zhao, Cong Hao, Deming Chen, Yingyan Lin

Research output: Contribution to journalArticlepeer-review

Abstract

Recent advancements in deep neural networks (DNNs) and the slowing of Moore's law have made domain-specific hardware accelerators for DNNs (i.e., DNN chips) a promising means for enabling more extensive DNN applications. However, designing DNN chips is challenging due to: 1) the vast and nonstandardized design space and 2) different DNN models' varying performance preferences regarding hardware micro-architecture and dataflows. Therefore, designing a DNN chip often takes a large team of interdisciplinary experts months to years. To enable flexible and efficient DNN chip design, we propose AutoAI2C: a DNN chip generator that can automatically generate both FPGA- and ASIC-based DNN accelerator implementation (i.e., synthesizable hardware and deployment code) with optimized algorithm-to-hardware mapping, given the DNN model specification from mainstream machine learning frameworks (e.g., PyTorch). Specifically, AutoAI2C consists of two major components: 1) a Chip Predictor, which can efficiently and reliably predict a DNN accelerator's energy, latency, and resource consumption using the proposed graph-based intermediate accelerator representation and 2) a Chip Builder, which can generate and optimize DNN accelerator designs by automatically exploring the design space based on targeting metrics and the Chip Predictor's performance feedback. Extensive experiments show that our Chip Predictor's predictions differ by <10% from real-measured ones. Furthermore, AutoAI2C generated accelerators can achieve performance comparable to or better than state-of-the-art accelerators, achieving up to a 2.12\times $ throughput improvements or 2.4\times $ latency reduction with the same level of hardware resource usage, or reducing energy consumption by up to 1.6\times $ , when running the same DNN workloads.

Original languageEnglish (US)
Pages (from-to)3143-3156
Number of pages14
JournalIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Volume43
Issue number10
DOIs
StatePublished - 2024

Keywords

  • AI chips
  • Design automation
  • Genetic algorithms
  • Neural network hardware

ASJC Scopus subject areas

  • Software
  • Computer Graphics and Computer-Aided Design
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'AutoAI2C: An Automated Hardware Generator for DNN Acceleration On Both FPGA and ASIC'. Together they form a unique fingerprint.

Cite this