TY - GEN
T1 - Deep compressive offloading
T2 - 18th ACM Conference on Embedded Networked Sensor Systems, SenSys 2020
AU - Yao, Shuochao
AU - Li, Jinyang
AU - Liu, Dongxin
AU - Wang, Tianshi
AU - Liu, Shengzhong
AU - Shao, Huajie
AU - Abdelzaher, Tarek
N1 - We sincerely thank for the invaluable comments from anonymous shepherding and reviewing. Research reported in this paper was sponsored in part by DARPA award W911NF-17-C-0099, DTRA award HDTRA1-18-1-0026, and the Army Research Laboratory under Cooperative Agreements W911NF-17-2-0196.
PY - 2020/11/16
Y1 - 2020/11/16
N2 - With recent advances, neural networks have become a crucial building block in intelligent IoT systems and sensing applications. However, the excessive computational demand remains a serious impediment to their deployments on low-end IoT devices. With the emergence of edge computing, offloading grows into a promising technique to circumvent end-device limitations. However, transferring data between local and edge devices takes up a large proportion of time in existing offloading frameworks, creating a bottleneck for low-latency intelligent services. In this work, we propose a general framework, called deep compressive offloading. By integrating compressive sensing theory and deep learning, our framework can encode data for offloading into tiny sizes with negligible overhead on local devices and decode the data on the edge server, while offering theoretical guarantees on perfect reconstruction and lossless inference. By trading edge computing resources for data transmission time, our design can significantly reduce offloading latency with almost no accuracy loss. We build a deep compressive offloading system to serve state-of-the-art computer vision and speech recognition services. With comprehensive evaluations, our system can consistently reduce end-to-end latency by 2X to 4X with 1% accuracy loss, compared to state-of-the-art neural network offloading systems. In conditions of limited network bandwidth or intensive background traffic, our system can further speed up the neural network inference by up to 35X 1.
AB - With recent advances, neural networks have become a crucial building block in intelligent IoT systems and sensing applications. However, the excessive computational demand remains a serious impediment to their deployments on low-end IoT devices. With the emergence of edge computing, offloading grows into a promising technique to circumvent end-device limitations. However, transferring data between local and edge devices takes up a large proportion of time in existing offloading frameworks, creating a bottleneck for low-latency intelligent services. In this work, we propose a general framework, called deep compressive offloading. By integrating compressive sensing theory and deep learning, our framework can encode data for offloading into tiny sizes with negligible overhead on local devices and decode the data on the edge server, while offering theoretical guarantees on perfect reconstruction and lossless inference. By trading edge computing resources for data transmission time, our design can significantly reduce offloading latency with almost no accuracy loss. We build a deep compressive offloading system to serve state-of-the-art computer vision and speech recognition services. With comprehensive evaluations, our system can consistently reduce end-to-end latency by 2X to 4X with 1% accuracy loss, compared to state-of-the-art neural network offloading systems. In conditions of limited network bandwidth or intensive background traffic, our system can further speed up the neural network inference by up to 35X 1.
KW - compressive offloading
KW - compressive sensing
KW - deep learning
KW - edge computing
KW - internet of things
KW - offloading
UR - https://www.scopus.com/pages/publications/85097567691
UR - https://www.scopus.com/pages/publications/85097567691#tab=citedBy
U2 - 10.1145/3384419.3430898
DO - 10.1145/3384419.3430898
M3 - Conference contribution
AN - SCOPUS:85097567691
T3 - SenSys 2020 - Proceedings of the 2020 18th ACM Conference on Embedded Networked Sensor Systems
SP - 476
EP - 488
BT - SenSys 2020 - Proceedings of the 2020 18th ACM Conference on Embedded Networked Sensor Systems
PB - Association for Computing Machinery
Y2 - 16 November 2020 through 19 November 2020
ER -