TY - GEN
T1 - Inverse optimal control for a hybrid dynamical system with impacts
AU - Aghasadeghi, Navid
AU - Long, Andrew
AU - Bretl, Timothy
PY - 2012
Y1 - 2012
N2 - In this paper, we develop an approach to inverse optimal control for a class of hybrid dynamical system with impacts. As it is usually posed, the problem of inverse optimal control is to find a cost function that is consistent with an observed sequence of decisions, under the assumption that these decisions are optimal. We assume instead that observed decisions are only approximately optimal and find a cost function that minimizes the extent to which these decisions violate first-order necessary conditions for optimality. For the hybrid dynamical system that we consider with a cost function that is a linear combination of known basis functions, this minimization is a convex program. In fact, it reduces to a simple least-squares computation that - unlike most other forms of inverse optimal control - can be solved very efficiently. We apply our approach to a dynamic bipedal climbing robot in simulation, showing that we can recover cost functions from observed trajectories that are consistent with two different modes of locomotion.
AB - In this paper, we develop an approach to inverse optimal control for a class of hybrid dynamical system with impacts. As it is usually posed, the problem of inverse optimal control is to find a cost function that is consistent with an observed sequence of decisions, under the assumption that these decisions are optimal. We assume instead that observed decisions are only approximately optimal and find a cost function that minimizes the extent to which these decisions violate first-order necessary conditions for optimality. For the hybrid dynamical system that we consider with a cost function that is a linear combination of known basis functions, this minimization is a convex program. In fact, it reduces to a simple least-squares computation that - unlike most other forms of inverse optimal control - can be solved very efficiently. We apply our approach to a dynamic bipedal climbing robot in simulation, showing that we can recover cost functions from observed trajectories that are consistent with two different modes of locomotion.
UR - http://www.scopus.com/inward/record.url?scp=84864460786&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864460786&partnerID=8YFLogxK
U2 - 10.1109/ICRA.2012.6225259
DO - 10.1109/ICRA.2012.6225259
M3 - Conference contribution
AN - SCOPUS:84864460786
SN - 9781467314039
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 4962
EP - 4967
BT - 2012 IEEE International Conference on Robotics and Automation, ICRA 2012
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2012 IEEE International Conference on Robotics and Automation, ICRA 2012
Y2 - 14 May 2012 through 18 May 2012
ER -