TY - JOUR
T1 - Relaxation techniques and asynchronous algorithms for on-line computation of non-cooperative equilibria
AU - Başar, Tamer
N1 - Funding Information:
*This work was supported in part by the Air Force Office of Scientific Research under Grant No. AFOSR 084-0056.
PY - 1987/12
Y1 - 1987/12
N2 - The equilibrium solution of a zero-sum or a non-zero-sum game is said to be stable if, after any deviation from that equilibrium, an adjustment process that involves unilateral optimal responses by the players can bring it back to the starting point. One appealing feature of a stable equilibrium is that in the on-line adjustment process the players need to know only their own cost functions and the most recently computed (and broadcast) policies of the other players, and not the other players' cost functions. It is needless to say that not all equilibrium (saddle-point or Nash) solutions are stable, and hence the question arises whether there exists a different on-line (real-time implementable) computational algorithm which would converge to an equilibrium even if that equilibrium is not stable. In this paper, we address precisely this question, and introduce a relaxation technique which leads to on-line implementable algorithms that converge to equilibria, be they stable or unstable, and in some cases in a finite number of steps. We also obtain conditions for convergence of asynchronous algorithms which arise in the computation of equilibria in games where the order of responses in not fixed a priori. The discussion and the analyses are primarily confined to two-person deterministic games, with extensions to N-player games and stochastic games briefly mentioned and left as topics for future research.
AB - The equilibrium solution of a zero-sum or a non-zero-sum game is said to be stable if, after any deviation from that equilibrium, an adjustment process that involves unilateral optimal responses by the players can bring it back to the starting point. One appealing feature of a stable equilibrium is that in the on-line adjustment process the players need to know only their own cost functions and the most recently computed (and broadcast) policies of the other players, and not the other players' cost functions. It is needless to say that not all equilibrium (saddle-point or Nash) solutions are stable, and hence the question arises whether there exists a different on-line (real-time implementable) computational algorithm which would converge to an equilibrium even if that equilibrium is not stable. In this paper, we address precisely this question, and introduce a relaxation technique which leads to on-line implementable algorithms that converge to equilibria, be they stable or unstable, and in some cases in a finite number of steps. We also obtain conditions for convergence of asynchronous algorithms which arise in the computation of equilibria in games where the order of responses in not fixed a priori. The discussion and the analyses are primarily confined to two-person deterministic games, with extensions to N-player games and stochastic games briefly mentioned and left as topics for future research.
UR - http://www.scopus.com/inward/record.url?scp=38249035395&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38249035395&partnerID=8YFLogxK
U2 - 10.1016/S0165-1889(87)80006-4
DO - 10.1016/S0165-1889(87)80006-4
M3 - Article
AN - SCOPUS:38249035395
SN - 0165-1889
VL - 11
SP - 531
EP - 549
JO - Journal of Economic Dynamics and Control
JF - Journal of Economic Dynamics and Control
IS - 4
ER -