TY - GEN
T1 - A convex approach to inverse optimal control and its application to modeling human locomotion
AU - Puydupin-Jamin, Anne Sophie
AU - Johnson, Miles
AU - Bretl, Timothy Wolfe
PY - 2012
Y1 - 2012
N2 - Inverse optimal control is the problem of computing a cost function that would have resulted in an observed sequence of decisions. The standard formulation of this problem assumes that decisions are optimal and tries to minimize the difference between what was observed and what would have been observed given a candidate cost function. We assume instead that decisions are only approximately optimal and try to minimize the extent to which observed decisions violate first-order necessary conditions for optimality. For a discrete-time optimal control system with a cost function that is a linear combination of known basis functions, this formulation leads to an efficient method of solution as an unconstrained least-squares problem. We apply this approach to both simulated and experimental data to obtain a simple model of human walking trajectories. This model might subsequently be used either for control of a humanoid robot or for predicting human motion when moving a robot through crowded areas.
AB - Inverse optimal control is the problem of computing a cost function that would have resulted in an observed sequence of decisions. The standard formulation of this problem assumes that decisions are optimal and tries to minimize the difference between what was observed and what would have been observed given a candidate cost function. We assume instead that decisions are only approximately optimal and try to minimize the extent to which observed decisions violate first-order necessary conditions for optimality. For a discrete-time optimal control system with a cost function that is a linear combination of known basis functions, this formulation leads to an efficient method of solution as an unconstrained least-squares problem. We apply this approach to both simulated and experimental data to obtain a simple model of human walking trajectories. This model might subsequently be used either for control of a humanoid robot or for predicting human motion when moving a robot through crowded areas.
UR - http://www.scopus.com/inward/record.url?scp=84864439219&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864439219&partnerID=8YFLogxK
U2 - 10.1109/ICRA.2012.6225317
DO - 10.1109/ICRA.2012.6225317
M3 - Conference contribution
AN - SCOPUS:84864439219
SN - 9781467314039
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 531
EP - 536
BT - 2012 IEEE International Conference on Robotics and Automation, ICRA 2012
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2012 IEEE International Conference on Robotics and Automation, ICRA 2012
Y2 - 14 May 2012 through 18 May 2012
ER -