TY - GEN
T1 - Nash Equilibrium Seeking with Players Acting through Heat PDE Dynamics
AU - Oliveira, Tiago Roux
AU - Rodrigues, Victor Hugo Pereira
AU - Krstic, Miroslav
AU - Basar, Tamer
N1 - This work was supported in part by the Brazilian Funding Agencies CAPES (Finance Code 001), CNPq and FAPERJ. 1-Tiago Roux Oliveira is with State University of Rio de Janeiro (UERJ), Rio de Janeiro – RJ, Brazil. Email: [email protected] 2-Victor Hugo Pereira Rodrigues is with Federal University of Rio de Janeiro (UFRJ/COPPE), Rio de Janeiro – RJ, Brazil. Email: [email protected] 3-Miroslav Krstić is with University of California at San Diego (UCSD), San Diego – CA, USA. Email: [email protected] 4-Tamer Bas¸ar is with University of Illinois at Urbana-Champaign, Urbana – IL, USA. Email: [email protected]
PY - 2021/5/25
Y1 - 2021/5/25
N2 - We propose a non-model based strategy for locally stable convergence to Nash equilibria in quadratic noncooperative (duopoly) games with player actions subject to diffusion (heat) PDE dynamics with distinct diffusion coefficients and each player having access only to his own payoff value. The proposed approach employs extremum seeking, with sinusoidal perturbation signals employed to estimate the Gradient (first derivative) and Hessian (second derivative) of unknown quadratic functions. In our previous work, we solved Nash equilibrium seeking problems with input delays. This is the first instance of noncooperative games being tackled in a model-free fashion in the presence of heat PDE dynamics. In order to compensate distinct diffusion processes in the inputs of the two players, we employ boundary control with averaging-based estimates. We apply a small-gain analysis for the resulting Input-to-State Stable (ISS) parabolic PDE-ODE loop as well as averaging theory in infinite dimensions, due to the infinite-dimensional state of the heat PDEs, in order to obtain local convergence results to a small neighborhood of the Nash equilibrium. We quantify the size of these residual sets and illustrate the theoretical results numerically on an example of a two-player game under heat PDEs.
AB - We propose a non-model based strategy for locally stable convergence to Nash equilibria in quadratic noncooperative (duopoly) games with player actions subject to diffusion (heat) PDE dynamics with distinct diffusion coefficients and each player having access only to his own payoff value. The proposed approach employs extremum seeking, with sinusoidal perturbation signals employed to estimate the Gradient (first derivative) and Hessian (second derivative) of unknown quadratic functions. In our previous work, we solved Nash equilibrium seeking problems with input delays. This is the first instance of noncooperative games being tackled in a model-free fashion in the presence of heat PDE dynamics. In order to compensate distinct diffusion processes in the inputs of the two players, we employ boundary control with averaging-based estimates. We apply a small-gain analysis for the resulting Input-to-State Stable (ISS) parabolic PDE-ODE loop as well as averaging theory in infinite dimensions, due to the infinite-dimensional state of the heat PDEs, in order to obtain local convergence results to a small neighborhood of the Nash equilibrium. We quantify the size of these residual sets and illustrate the theoretical results numerically on an example of a two-player game under heat PDEs.
UR - http://www.scopus.com/inward/record.url?scp=85111910329&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85111910329&partnerID=8YFLogxK
U2 - 10.23919/ACC50511.2021.9483114
DO - 10.23919/ACC50511.2021.9483114
M3 - Conference contribution
AN - SCOPUS:85111910329
T3 - Proceedings of the American Control Conference
SP - 684
EP - 689
BT - 2021 American Control Conference, ACC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 American Control Conference, ACC 2021
Y2 - 25 May 2021 through 28 May 2021
ER -