TY - GEN
T1 - Underprovisioned GPUs
T2 - 32nd International Conference on Computer Communications and Networks, ICCCN 2023
AU - Hu, Yigong
AU - Gokarn, Ila
AU - Liu, Shengzhong
AU - Misra, Archan
AU - Abdelzaher, Tarek
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Recent work suggests that computing resources, such as GPUs in real-time edge-based perception systems, need not have sufficient capacity to keep up with the input frame rates of all input devices (e.g., cameras) at their full-frame resolution. Rather, they can be under-provisioned because only parts of any given frame need to be inspected (i.e., paid attention to). This paper derives an attention allocation policy, called canvas-based attention scheduling that decides which parts of each frame of each device to inspect, and a corresponding schedulability condition that relates the spatiotemporal properties of surrounding objects to the ability of the edge-based perception subsystem to keep up with the state of the environment in real-time. It provides a quantitative estimate of adequate computing capacity for the expected perception workload. We implement a canvas-based attention scheduler for an object detection application and perform an empirical comparative study based on actual GPU hardware and surveillance videos. Results show that canvas-based attention scheduling keeps up with the environment while using a much smaller GPU capacity, compared with prior approaches.
AB - Recent work suggests that computing resources, such as GPUs in real-time edge-based perception systems, need not have sufficient capacity to keep up with the input frame rates of all input devices (e.g., cameras) at their full-frame resolution. Rather, they can be under-provisioned because only parts of any given frame need to be inspected (i.e., paid attention to). This paper derives an attention allocation policy, called canvas-based attention scheduling that decides which parts of each frame of each device to inspect, and a corresponding schedulability condition that relates the spatiotemporal properties of surrounding objects to the ability of the edge-based perception subsystem to keep up with the state of the environment in real-time. It provides a quantitative estimate of adequate computing capacity for the expected perception workload. We implement a canvas-based attention scheduler for an object detection application and perform an empirical comparative study based on actual GPU hardware and surveillance videos. Results show that canvas-based attention scheduling keeps up with the environment while using a much smaller GPU capacity, compared with prior approaches.
UR - http://www.scopus.com/inward/record.url?scp=85173584866&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85173584866&partnerID=8YFLogxK
U2 - 10.1109/ICCCN58024.2023.10230127
DO - 10.1109/ICCCN58024.2023.10230127
M3 - Conference contribution
AN - SCOPUS:85173584866
T3 - Proceedings - International Conference on Computer Communications and Networks, ICCCN
BT - ICCCN 2023 - 2023 32nd International Conference on Computer Communications and Networks
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 24 July 2023 through 27 July 2023
ER -