TY - JOUR
T1 - Risk-Sensitive Zero-Sum Differential Games
AU - Moon, Jun
AU - Duncan, Tyrone E.
AU - Basar, Tamer
N1 - Funding Information:
Manuscript received December 6, 2017; revised March 28, 2018; accepted May 18, 2018. Date of publication June 11, 2018; date of current version March 27, 2019. This work was supported in part by the National Research Foundation of Korea under Grant NRF-2016R1C1B1015474, Grant NRF-2017R1E1A1A03070936, and Grant NRF-2017R1A5A1015311, funded by the Ministry of Science and ICT; in part by the NSF grant DMS 1411412, ARO grant W911NF-14-10390, AFOSR grant FA9550-17-1-0073, and a Simons Fellowship; and in part by the Office of Naval Research (ONR) MURI under Grant N00014-16-1-2710. Recommended by Associate Editor R. P. Malhame. (Corresponding author: Jun Moon.) J. Moon is with the School of Electrical and Computer Engineering, University of Seoul, Seoul 02504, South Korea (e-mail: jmoon12@ uos.ac.kr).
Publisher Copyright:
© 1963-2012 IEEE.
PY - 2019/4
Y1 - 2019/4
N2 - We consider two-player risk-sensitive zero-sum differential games (RSZSDGs). In our problem setup, both the drift term and the diffusion term in the controlled stochastic differential equation are dependent on the state and controls of both players, and the objective functional is of the risk-sensitive type. First, a stochastic maximum principle type necessary condition for an open-loop saddle point of the RSZSDG is established via nonlinear transformations of the adjoint processes of the equivalent risk-neutral stochastic zero-sum differential game. In particular, we obtain two variational inequalities, namely, the pair of saddle-point inequalities of the RSZSDG. Next, we obtain the Hamilton-Jacobi-Isaacs partial differential equation for the RSZSDG, which provides a sufficient condition for a feedback saddle point of the RSZSDG, using a logarithmic transformation of the associated value function. Finally, we study the extended linear-quadratic RSZSDG (LQ-RSZSDG). We show intractability of the extended LQ-RSZSDG with the state and/or controls of both players appearing in the diffusion term. This unexpected intractability could lead to nonlinear open-loop and feedback saddle points even if the problem itself is essentially LQ and the Isaacs condition holds.
AB - We consider two-player risk-sensitive zero-sum differential games (RSZSDGs). In our problem setup, both the drift term and the diffusion term in the controlled stochastic differential equation are dependent on the state and controls of both players, and the objective functional is of the risk-sensitive type. First, a stochastic maximum principle type necessary condition for an open-loop saddle point of the RSZSDG is established via nonlinear transformations of the adjoint processes of the equivalent risk-neutral stochastic zero-sum differential game. In particular, we obtain two variational inequalities, namely, the pair of saddle-point inequalities of the RSZSDG. Next, we obtain the Hamilton-Jacobi-Isaacs partial differential equation for the RSZSDG, which provides a sufficient condition for a feedback saddle point of the RSZSDG, using a logarithmic transformation of the associated value function. Finally, we study the extended linear-quadratic RSZSDG (LQ-RSZSDG). We show intractability of the extended LQ-RSZSDG with the state and/or controls of both players appearing in the diffusion term. This unexpected intractability could lead to nonlinear open-loop and feedback saddle points even if the problem itself is essentially LQ and the Isaacs condition holds.
KW - Hamilton-Jacobi-Isaacs (HJI) equation
KW - linear-quadratic stochastic differential games
KW - risk-sensitive games
KW - stochastic differential games (SDGs)
KW - stochastic maximum principle (SMP)
UR - http://www.scopus.com/inward/record.url?scp=85048492788&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85048492788&partnerID=8YFLogxK
U2 - 10.1109/TAC.2018.2846048
DO - 10.1109/TAC.2018.2846048
M3 - Article
AN - SCOPUS:85048492788
VL - 64
SP - 1503
EP - 1518
JO - IEEE Transactions on Automatic Control
JF - IEEE Transactions on Automatic Control
SN - 0018-9286
IS - 4
M1 - 8378056
ER -