TY - CHAP
T1 - Nash equilibrium seeking for dynamic systems with non-quadratic payoffs
AU - Frihauf, Paul
AU - Krstic, Miroslav
AU - Başar, Tamer
N1 - Publisher Copyright:
© Springer Science+Business Media New York 2013.
PY - 2013
Y1 - 2013
N2 - We consider general, stable nonlinear differential equations with N inputs and N outputs, where in the steady state, the output signals represent the payoff functions of a noncooperative game played by the steady-state values of the input signals. To achieve locally stable convergence to the resulting steady-state Nash equilibria, we introduce a non-model-based approach, where the players determine their actions based only on their own payoff values. This strategy is based on the extremum seeking approach, which has previously been developed for standard optimization problems and employs sinusoidal perturbations to estimate the gradient. Since non-quadratic payoffs create the possibility of multiple, isolated Nash equilibria, our convergence results are local. Specifically, the attainment of any particular Nash equilibrium is not assured for all initial conditions, but only for initial conditions in a set around that specific stable Nash equilibrium. For non-quadratic costs, the convergence to a Nash equilibrium is not perfect, but is biased in proportion to the perturbation amplitudes and the higher derivatives of the payoff functions. We quantify the size of these residual biases.
AB - We consider general, stable nonlinear differential equations with N inputs and N outputs, where in the steady state, the output signals represent the payoff functions of a noncooperative game played by the steady-state values of the input signals. To achieve locally stable convergence to the resulting steady-state Nash equilibria, we introduce a non-model-based approach, where the players determine their actions based only on their own payoff values. This strategy is based on the extremum seeking approach, which has previously been developed for standard optimization problems and employs sinusoidal perturbations to estimate the gradient. Since non-quadratic payoffs create the possibility of multiple, isolated Nash equilibria, our convergence results are local. Specifically, the attainment of any particular Nash equilibrium is not assured for all initial conditions, but only for initial conditions in a set around that specific stable Nash equilibrium. For non-quadratic costs, the convergence to a Nash equilibrium is not perfect, but is biased in proportion to the perturbation amplitudes and the higher derivatives of the payoff functions. We quantify the size of these residual biases.
KW - Extremum seeking
KW - Learning
KW - Nash equilibria
KW - Noncooperative games
UR - http://www.scopus.com/inward/record.url?scp=85055051229&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85055051229&partnerID=8YFLogxK
U2 - 10.1007/978-0-8176-8355-9_9
DO - 10.1007/978-0-8176-8355-9_9
M3 - Chapter
AN - SCOPUS:85055051229
T3 - Annals of the International Society of Dynamic Games
SP - 179
EP - 198
BT - Annals of the International Society of Dynamic Games
PB - Birkhäuser
ER -