TY - GEN
T1 - Underprovisioned GPUs
T2 - 32nd International Conference on Computer Communications and Networks, ICCCN 2023
AU - Hu, Yigong
AU - Gokarn, Ila
AU - Liu, Shengzhong
AU - Misra, Archan
AU - Abdelzaher, Tarek
N1 - ACKNOWLEDGEMENT This work was sponsored in part by ARL W911NF-17-2-0196, NSF CNS 20-38817, IBM (IIDAI), the Boeing Company, and the National Research Foundation, Singapore under NRF-NRFI05-2019-0007. The views and conclusions contained in this document are those of the author(s) and should not be interpreted as representing the official policies of the U.S. DEVCOM Army Research Laboratory, the U.S. government, or the National Research Foundation, Singapore.
PY - 2023
Y1 - 2023
N2 - Recent work suggests that computing resources, such as GPUs in real-time edge-based perception systems, need not have sufficient capacity to keep up with the input frame rates of all input devices (e.g., cameras) at their full-frame resolution. Rather, they can be under-provisioned because only parts of any given frame need to be inspected (i.e., paid attention to). This paper derives an attention allocation policy, called canvas-based attention scheduling that decides which parts of each frame of each device to inspect, and a corresponding schedulability condition that relates the spatiotemporal properties of surrounding objects to the ability of the edge-based perception subsystem to keep up with the state of the environment in real-time. It provides a quantitative estimate of adequate computing capacity for the expected perception workload. We implement a canvas-based attention scheduler for an object detection application and perform an empirical comparative study based on actual GPU hardware and surveillance videos. Results show that canvas-based attention scheduling keeps up with the environment while using a much smaller GPU capacity, compared with prior approaches.
AB - Recent work suggests that computing resources, such as GPUs in real-time edge-based perception systems, need not have sufficient capacity to keep up with the input frame rates of all input devices (e.g., cameras) at their full-frame resolution. Rather, they can be under-provisioned because only parts of any given frame need to be inspected (i.e., paid attention to). This paper derives an attention allocation policy, called canvas-based attention scheduling that decides which parts of each frame of each device to inspect, and a corresponding schedulability condition that relates the spatiotemporal properties of surrounding objects to the ability of the edge-based perception subsystem to keep up with the state of the environment in real-time. It provides a quantitative estimate of adequate computing capacity for the expected perception workload. We implement a canvas-based attention scheduler for an object detection application and perform an empirical comparative study based on actual GPU hardware and surveillance videos. Results show that canvas-based attention scheduling keeps up with the environment while using a much smaller GPU capacity, compared with prior approaches.
UR - http://www.scopus.com/inward/record.url?scp=85173584866&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173584866&partnerID=8YFLogxK
U2 - 10.1109/ICCCN58024.2023.10230127
DO - 10.1109/ICCCN58024.2023.10230127
M3 - Conference contribution
AN - SCOPUS:85173584866
T3 - Proceedings - International Conference on Computer Communications and Networks, ICCCN
BT - ICCCN 2023 - 2023 32nd International Conference on Computer Communications and Networks
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 July 2023 through 27 July 2023
ER -