TY - JOUR
T1 - DiffTune-MPC
T2 - Closed-Loop Learning for Model Predictive Control
AU - Tao, Ran
AU - Cheng, Sheng
AU - Wang, Xiaofeng
AU - Wang, Shenlong
AU - Hovakimyan, Naira
N1 - This paper was recommended for publication by Editor Jens Kober upon evaluation of the Associate Editor and Reviewers\u2019 comments. This work is supported by NASA cooperative agreement (80NSSC22M0070), NASA ULI (80NSSC22M0070), NASA USRC (NNH21ZEA001N-USRC), AFOSR (FA9550-21-1-0411), NSF-AoF Robust Intelligence (2133656), and NSF SLES (2331878, 2331879). \u2020These authors contributed equally to this work.
PY - 2024
Y1 - 2024
N2 - Model predictive control (MPC) has been applied to many platforms in robotics and autonomous systems for its capability to predict a system's future behavior while incorporating constraints that a system may have. To enhance the performance of a system with an MPC controller, one can manually tune the MPC's cost function. However, it can be challenging due to the possibly high dimension of the parameter space as well as the potential difference between the open-loop cost function in MPC and the overall closed-loop performance metric function. This letter presents DiffTune-MPC, a novel learning method, to learn the cost function of an MPC in a closed-loop manner. The proposed framework is compatible with the scenario where the time interval for performance evaluation and MPC's planning horizon have different lengths. We show the auxiliary problem whose solution admits the analytical gradients of MPC and discuss its variations in different MPC settings, including nonlinear MPCs that are solved using sequential quadratic programming. Simulation results demonstrate the learning capability of DiffTune-MPC and the generalization capability of the learned MPC parameters.
AB - Model predictive control (MPC) has been applied to many platforms in robotics and autonomous systems for its capability to predict a system's future behavior while incorporating constraints that a system may have. To enhance the performance of a system with an MPC controller, one can manually tune the MPC's cost function. However, it can be challenging due to the possibly high dimension of the parameter space as well as the potential difference between the open-loop cost function in MPC and the overall closed-loop performance metric function. This letter presents DiffTune-MPC, a novel learning method, to learn the cost function of an MPC in a closed-loop manner. The proposed framework is compatible with the scenario where the time interval for performance evaluation and MPC's planning horizon have different lengths. We show the auxiliary problem whose solution admits the analytical gradients of MPC and discuss its variations in different MPC settings, including nonlinear MPCs that are solved using sequential quadratic programming. Simulation results demonstrate the learning capability of DiffTune-MPC and the generalization capability of the learned MPC parameters.
KW - Learning for control
KW - model predictive control
KW - parameter optimization
UR - http://www.scopus.com/inward/record.url?scp=85197508349&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85197508349&partnerID=8YFLogxK
U2 - 10.1109/LRA.2024.3422836
DO - 10.1109/LRA.2024.3422836
M3 - Article
AN - SCOPUS:85197508349
SN - 2377-3766
VL - 9
SP - 7294
EP - 7301
JO - IEEE Robotics and Automation Letters
JF - IEEE Robotics and Automation Letters
IS - 8
ER -