Cloud-DNN: An open framework for mapping DNN models to cloud FPGAS

Yao Chen, Jiong He, Xiaofan Zhang, Cong Hao, Deming Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The efficacy and effectiveness of Convolutional Neural Networks (CNNs) have been proven in a wide range of machine learning applications. However, the high computational complexity of CNNs presents a critical challenge towards their broader adoption in real-time and power-efficient scenarios. FPGAs are poised to take a significant role for high-performance and energy-efficient computation of CNNs for both mobile (e.g., UAVs, self-driving cars, and IoT devices) and cloud computing domains. However, implementing an effective CNN system onto FPGAs efficiently remains problematic. The current cloud-based FPGAs with unique design constraints and architectural characteristics further increase the challenges. To address these challenges, we propose a novel open-source automated tool chain called Cloud-DNN. Our tool chain takes trained CNN models specified in Caffe as input, performs a set of transformations, and maps the model to a cloud-based FPGA. Cloud-DNN can significantly improve the overall design productivity of CNNs on FPGAs while satisfying the emergent computational requirements. Our design provides an alternative solution compared to other cloud-based options (e.g., GPUs or TPUs) while offering flexible, and high performance DNN inferences. The unique features of Cloud-DNN include the optimizations with cloud-platform characteristics and the support of easier and streamlined implementation. Experimental results demonstrate up to 104.55× performance improvement when compared to CPU implementation and comparable usability, flexibility, and strong quality compared to other state-of-the-art DNN inference implementations on standalone FPGAs.

Original languageEnglish (US)
Title of host publicationFPGA 2019 - Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
PublisherAssociation for Computing Machinery
Pages73-82
Number of pages10
ISBN (Electronic)9781450361378
DOIs
StatePublished - Feb 20 2019
Event2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, FPGA 2019 - Seaside, United States
Duration: Feb 24 2019Feb 26 2019

Publication series

NameFPGA 2019 - Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

Conference

Conference2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, FPGA 2019
Country/TerritoryUnited States
CitySeaside
Period2/24/192/26/19

Keywords

  • Cloud Computing
  • DNN Accelerator
  • FPGA
  • High-Level Synthesis

ASJC Scopus subject areas

  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Cloud-DNN: An open framework for mapping DNN models to cloud FPGAS'. Together they form a unique fingerprint.

Cite this