TY - GEN
T1 - DeepPicar
T2 - 24th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications, RTCSA 2018
AU - Bechtel, Michael G.
AU - McEllhiney, Elise
AU - Kim, Minje
AU - Yun, Heechul
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2019/1/9
Y1 - 2019/1/9
N2 - We present DeepPicar, a low-cost deep neural network based autonomous car platform. DeepPicar is a small scale replication of a real self-driving car called DAVE-2 by NVIDIA. DAVE-2 uses a deep convolutional neural network (CNN), which takes images from a front-facing camera as input and produces car steering angles as output. DeepPicar uses the same network architecture - 9 layers, 27 million connections and 250K parameters - and can drive itself in real-time using a web camera and a Raspberry Pi 3 quad-core platform. Using DeepPicar, we analyze the Pi 3's computing capabilities to support end-to-end deep learning based real-time control of autonomous vehicles. We also systematically compare other contemporary embedded computing platforms using the DeepPicar's CNN-based real-time control workload. We find that all tested platforms, including the Pi 3, are capable of supporting the CNN-based real-time control, from 20 Hz up to 100 Hz, depending on hardware platform. However, we find that shared resource contention remains an important issue that must be considered in applying CNN models on shared memory based embedded computing platforms; we observe up to 11.6X execution time increase in the CNN based control loop due to shared resource contention. To protect the CNN workload, we also evaluate state-of-the-art cache partitioning and memory bandwidth throttling techniques on the Pi 3. We find that cache partitioning is ineffective, while memory bandwidth throttling is an effective solution.
AB - We present DeepPicar, a low-cost deep neural network based autonomous car platform. DeepPicar is a small scale replication of a real self-driving car called DAVE-2 by NVIDIA. DAVE-2 uses a deep convolutional neural network (CNN), which takes images from a front-facing camera as input and produces car steering angles as output. DeepPicar uses the same network architecture - 9 layers, 27 million connections and 250K parameters - and can drive itself in real-time using a web camera and a Raspberry Pi 3 quad-core platform. Using DeepPicar, we analyze the Pi 3's computing capabilities to support end-to-end deep learning based real-time control of autonomous vehicles. We also systematically compare other contemporary embedded computing platforms using the DeepPicar's CNN-based real-time control workload. We find that all tested platforms, including the Pi 3, are capable of supporting the CNN-based real-time control, from 20 Hz up to 100 Hz, depending on hardware platform. However, we find that shared resource contention remains an important issue that must be considered in applying CNN models on shared memory based embedded computing platforms; we observe up to 11.6X execution time increase in the CNN based control loop due to shared resource contention. To protect the CNN workload, we also evaluate state-of-the-art cache partitioning and memory bandwidth throttling techniques on the Pi 3. We find that cache partitioning is ineffective, while memory bandwidth throttling is an effective solution.
KW - Autonomous car
KW - Case study
KW - Convolutional neural network
KW - Real-time
UR - http://www.scopus.com/inward/record.url?scp=85061825698&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061825698&partnerID=8YFLogxK
U2 - 10.1109/RTCSA.2018.00011
DO - 10.1109/RTCSA.2018.00011
M3 - Conference contribution
AN - SCOPUS:85061825698
T3 - Proceedings - 2018 IEEE 24th International Conference on Embedded and Real-Time Computing Systems and Applications, RTCSA 2018
SP - 11
EP - 21
BT - Proceedings - 2018 IEEE 24th International Conference on Embedded and Real-Time Computing Systems and Applications, RTCSA 2018
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 29 August 2018 through 31 August 2018
ER -