TY - GEN
T1 - Nash Equilibrium Seeking with Arbitrarily Delayed Player Actions
AU - Oliveira, Tiago Roux
AU - Hugo Pereira Rodrigues, Victor
AU - Krstic, Miroslav
AU - Basar, Tamer
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/12/14
Y1 - 2020/12/14
N2 - In this paper, we propose a non-model based strategy for locally stable convergence to Nash equilibrium in a quadratic noncooperative (duopoly) game with arbitrarily delayed player actions. In our noncooperative scenario, the players have access only to their own payoff values, again with delay. The proposed approach is based on the extremum seeking perspective, which has previously been reported for real-time optimization problems by exploring sinusoidal perturbation signals to estimate the Gradient (first derivative) and Hessian (second derivative) of unknown locally quadratic functions. Indeed, this is the first contribution which considers extremum seeking for noncooperative games in the presence of delays. In order to compensate distinct delays in the inputs of the two players, we have employed boundary control via predictor feedback with averaging-based estimates. We apply a small-gain analysis for the resulting Input-to-State Stable hyperbolic PDEODE loop as well as averaging theory in infinite dimensions, due to the infinite-dimensional state of the time delays, in order to obtain local convergence results to a small neighborhood of the Nash equilibrium. We quantify the size of these residual sets and corroborate the theoretical results numerically on an example of a two-player game with delays.
AB - In this paper, we propose a non-model based strategy for locally stable convergence to Nash equilibrium in a quadratic noncooperative (duopoly) game with arbitrarily delayed player actions. In our noncooperative scenario, the players have access only to their own payoff values, again with delay. The proposed approach is based on the extremum seeking perspective, which has previously been reported for real-time optimization problems by exploring sinusoidal perturbation signals to estimate the Gradient (first derivative) and Hessian (second derivative) of unknown locally quadratic functions. Indeed, this is the first contribution which considers extremum seeking for noncooperative games in the presence of delays. In order to compensate distinct delays in the inputs of the two players, we have employed boundary control via predictor feedback with averaging-based estimates. We apply a small-gain analysis for the resulting Input-to-State Stable hyperbolic PDEODE loop as well as averaging theory in infinite dimensions, due to the infinite-dimensional state of the time delays, in order to obtain local convergence results to a small neighborhood of the Nash equilibrium. We quantify the size of these residual sets and corroborate the theoretical results numerically on an example of a two-player game with delays.
UR - http://www.scopus.com/inward/record.url?scp=85099884253&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099884253&partnerID=8YFLogxK
U2 - 10.1109/CDC42340.2020.9303894
DO - 10.1109/CDC42340.2020.9303894
M3 - Conference contribution
AN - SCOPUS:85099884253
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 150
EP - 155
BT - 2020 59th IEEE Conference on Decision and Control, CDC 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 59th IEEE Conference on Decision and Control, CDC 2020
Y2 - 14 December 2020 through 18 December 2020
ER -