Nash equilibrium seeking for dynamic systems with non-quadratic payoffs

Paul Frihauf, Miroslav Krstic, Tamer Başar

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

We consider general, stable nonlinear differential equations with N inputs and N outputs, where in the steady state, the output signals represent the payoff functions of a noncooperative game played by the steady-state values of the input signals. To achieve locally stable convergence to the resulting steady-state Nash equilibria, we introduce a non-model-based approach, where the players determine their actions based only on their own payoff values. This strategy is based on the extremum seeking approach, which has previously been developed for standard optimization problems and employs sinusoidal perturbations to estimate the gradient. Since non-quadratic payoffs create the possibility of multiple, isolated Nash equilibria, our convergence results are local. Specifically, the attainment of any particular Nash equilibrium is not assured for all initial conditions, but only for initial conditions in a set around that specific stable Nash equilibrium. For non-quadratic costs, the convergence to a Nash equilibrium is not perfect, but is biased in proportion to the perturbation amplitudes and the higher derivatives of the payoff functions. We quantify the size of these residual biases.

Original languageEnglish (US)
Title of host publicationAnnals of the International Society of Dynamic Games
PublisherBirkhäuser
Pages179-198
Number of pages20
DOIs
StatePublished - 2013

Publication series

NameAnnals of the International Society of Dynamic Games
Volume12
ISSN (Print)2474-0179
ISSN (Electronic)2474-0187

Keywords

  • Extremum seeking
  • Learning
  • Nash equilibria
  • Noncooperative games

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Nash equilibrium seeking for dynamic systems with non-quadratic payoffs'. Together they form a unique fingerprint.

Cite this