WinoCNN: Kernel sharing Winograd systolic array for efficient convolutional neural network acceleration on FPGAs

Xinheng Liu, Yao Chen, Cong Hao, Ashutosh Dhar, Deming Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The combination of Winograd's algorithm and systolic array architecture has demonstrated the capability of improving DSP efficiency in accelerating convolutional neural networks (CNNs) on FPGA platforms. However, handling arbitrary convolution kernel sizes in FPGA-based Winograd processing elements and supporting efficient data access remain underexplored. In this work, we are the first to propose an optimized Winograd processing element (WinoPE), which can naturally support multiple convolution kernel sizes with the same amount of computing resources and maintains high runtime DSP efficiency. Using the proposed WinoPE, we construct a highly efficient systolic array accelerator, termed WinoCNN. We also propose a dedicated memory subsystem to optimize the data access. Based on the accelerator architecture, we build accurate resource and performance modeling to explore optimal accelerator configurations under different resource constraints. We implement our proposed accelerator on multiple FPGAs, which outperforms the state-of-the-art designs in terms of both throughput and DSP efficiency. Our implementation achieves DSP efficiency up to 1.33 GOPS/DSP and throughput up to 3.1 TOPS with the Xilinx ZCU102 FPGA. These are 29.1% and 20.0% better than the best solutions reported previously, respectively.

Original languageEnglish (US)
Title of host publicationProceedings - 32nd IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages258-265
Number of pages8
ISBN (Electronic)9781665427012
DOIs
StatePublished - Jul 2021
Event32nd IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2021 - Virtual, Online, United States
Duration: Jul 7 2021Jul 8 2021

Publication series

NameProceedings of the International Conference on Application-Specific Systems, Architectures and Processors
Volume2021-text
ISSN (Print)1063-6862

Conference

Conference32nd IEEE International Conference on Application-Specific Systems, Architectures and Processors, ASAP 2021
Country/TerritoryUnited States
CityVirtual, Online
Period7/7/217/8/21

Keywords

  • CNN
  • DSP efficiency
  • FPGA
  • Systolic array
  • Winograd algorithm

ASJC Scopus subject areas

  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'WinoCNN: Kernel sharing Winograd systolic array for efficient convolutional neural network acceleration on FPGAs'. Together they form a unique fingerprint.

Cite this