TY - JOUR
T1 - DiffTune
T2 - Autotuning Through Autodifferentiation
AU - Cheng, Sheng
AU - Kim, Minkyung
AU - Song, Lin
AU - Yang, Chengyu
AU - Jin, Yiquan
AU - Wang, Shenlong
AU - Hovakimyan, Naira
N1 - *This work is supported by NASA under the cooperative agreement 80NSSC20M0229, NSF-AoF Robust Intelligence (2133656), NSF SLES (2331878), Air Force Office of Scientific Research (AFOSR) grant FA9550-21-1-0411, Amazon Research Award, and Illinois-Insper Collaborative Research Fund. \u2217These authors contributed equally to this work. S. Cheng, M. Kim, L. Song, C. Yang, Y. Jin, and N. Hovakimyan are with the Department of Mechanical Science and Engineering, and S. Wang is with the Department of Computer Science. All authors are with the University of Illinois Urbana-Champaign, USA. (email: {chengs,mk58,linsong2,cy45,yiquanj2,shenlong, nhovakim}@illinois.edu)
This work was supported in part by NASA under the cooperative agreement 80NSSC20M0229, in part by National Science Foundation (NSF)-AoF Robust Intelligence under Grant 2133656, in part by NSF SLES under Grant 2331878, in part by Air Force Office of Scientific Research (AFOSR) under Grant FA9550-21-1-0411, in part by Amazon Research Award, and in part by Illinois-Insper Collaborative Research Fund.
PY - 2024
Y1 - 2024
N2 - The performance of robots in high-level tasks depends on the quality of their lower level controller, which requires fine-tuning. However, the intrinsically nonlinear dynamics and controllers make tuning a challenging task when it is done by hand. In this article, we present DiffTune, a novel, gradient-based automatic tuning framework. We formulate the controller tuning as a parameter optimization problem. Our method unrolls the dynamical system and controller as a computational graph and updates the controller parameters through gradient-based optimization. The gradient is obtained using sensitivity propagation, which is the only method for gradient computation when tuning for a physical system instead of its simulated counterpart. Furthermore, we use L
1 adaptive control to compensate for the uncertainties (that unavoidably exist in a physical system) such that the gradient is not biased by the unmodeled uncertainties. We validate the DiffTune on a Dubin's car and a quadrotor in challenging simulation environments. In comparison with state-of-the-art autotuning methods, DiffTune achieves the best performance in a more efficient manner owing to its effective usage of the first-order information of the system. Experiments on tuning a nonlinear controller for quadrotor show promising results, where DiffTune achieves 3.5× tracking error reduction on an aggressive trajectory in only ten trials over a 12-D controller parameter space.
AB - The performance of robots in high-level tasks depends on the quality of their lower level controller, which requires fine-tuning. However, the intrinsically nonlinear dynamics and controllers make tuning a challenging task when it is done by hand. In this article, we present DiffTune, a novel, gradient-based automatic tuning framework. We formulate the controller tuning as a parameter optimization problem. Our method unrolls the dynamical system and controller as a computational graph and updates the controller parameters through gradient-based optimization. The gradient is obtained using sensitivity propagation, which is the only method for gradient computation when tuning for a physical system instead of its simulated counterpart. Furthermore, we use L
1 adaptive control to compensate for the uncertainties (that unavoidably exist in a physical system) such that the gradient is not biased by the unmodeled uncertainties. We validate the DiffTune on a Dubin's car and a quadrotor in challenging simulation environments. In comparison with state-of-the-art autotuning methods, DiffTune achieves the best performance in a more efficient manner owing to its effective usage of the first-order information of the system. Experiments on tuning a nonlinear controller for quadrotor show promising results, where DiffTune achieves 3.5× tracking error reduction on an aggressive trajectory in only ten trials over a 12-D controller parameter space.
KW - Aerial systems: Mechanics and control
KW - controller auto-tuning
KW - learning and adaptive systems
UR - http://www.scopus.com/inward/record.url?scp=85199054845&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85199054845&partnerID=8YFLogxK
U2 - 10.1109/TRO.2024.3429191
DO - 10.1109/TRO.2024.3429191
M3 - Article
AN - SCOPUS:85199054845
SN - 1552-3098
VL - 40
SP - 4085
EP - 4101
JO - IEEE Transactions on Robotics
JF - IEEE Transactions on Robotics
ER -