Abstract
Manually optimizing the tradeoffs between accuracy, performance and energy for resource-intensive applications with flexible accuracy or precision requirements is extremely difficult. We present ApproxTuner, an automatic framework for accuracy-aware optimization of tensor-based applications while requiring only high-level end-to-end quality specifications. ApproxTuner implements and manages approximations in algorithms, system software, and hardware. The key contribution in ApproxTuner is a novel three-phase approach to approximation-tuning that consists of development-time, install-time, and run-time phases. Our approach decouples tuning of hardware-independent and hardware-specific approximations, thus providing retargetability across devices. To enable efficient autotuning of approximation choices, we present a novel accuracy-aware tuning technique called predictive approximation-tuning, which significantly speeds up autotuning by analytically predicting the accuracy impacts of approximations. We evaluate ApproxTuner across 10 convolutional neural networks (CNNs) and a combined CNN and image processing benchmark. For the evaluated CNNs, using only hardware-independent approximation choices we achieve a mean speedup of 2.1x (max 2.7x) on a GPU, and 1.3x mean speedup (max 1.9x) on the CPU, while staying within 1 percentage point of inference accuracy loss. For two different accuracy-prediction models, ApproxTuner speeds up tuning by 12.8x and 20.4x compared to conventional empirical tuning while achieving comparable benefits.
Original language | English (US) |
---|---|
Title of host publication | PPoPP 2021 - Proceedings of the 2021 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming |
Publisher | Association for Computing Machinery |
Pages | 262-277 |
Number of pages | 16 |
ISBN (Electronic) | 9781450382946 |
DOIs | |
State | Published - Feb 17 2021 |
Event | 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2021 - Virtual, Online, Korea, Republic of Duration: Feb 27 2021 → Mar 3 2021 |
Publication series
Name | Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP |
---|
Conference
Conference | 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2021 |
---|---|
Country/Territory | Korea, Republic of |
City | Virtual, Online |
Period | 2/27/21 → 3/3/21 |
Keywords
- approximate computing
- compilers
- deep neural networks
- heterogeneous systems
ASJC Scopus subject areas
- Software
Fingerprint
Dive into the research topics of 'ApproxTuner: A compiler and runtime system for adaptive approximations'. Together they form a unique fingerprint.Datasets
-
ApproxTuner DNN Models
Zhao, Y. (Creator), Sharif, H. (Creator), Adve, V. S. (Creator) & Misailovic, S. (Creator), University of Illinois Urbana-Champaign, Mar 23 2021
DOI: 10.13012/B2IDB-6565690_V1
Dataset