TY - GEN
T1 - Policy-Dependent and Policy-Independent Static Reduction of Stochastic Dynamic Teams and Games and Fragility of Equivalence Properties
AU - Sanjari, Sina
AU - Basar, Tamer
AU - Yuksel, Serdar
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - In stochastic control, information structure arguments have been crucial for stochastic analysis. Such an approach is often called static reduction in dynamic team theory (or decentralized stochastic control) and has been an effective method for establishing existence and approximation results for optimal policies. In this paper, we classify such static reductions into three categories: (i) those that are policy-independent (introduced by Witsenhausen in [17]), (ii) those that are policy-dependent (introduced by Ho and Chu [7], [8] for partially nested dynamic teams), and (iii) static measurement with control-sharing reduction (where the measurements become static although control actions are shared according to the partially nested information structure). For these reductions, while there exist bijection relationships between glob-ally optimal solutions of dynamic teams and their reductions, in general there is no bijection for person-by-person optimal policies. We also establish a similar result (but not identical) concerning stationary solutions. We present sufficient conditions under which bijection relationships hold. Under static measurement with control-sharing reduction, connections between optimality concepts can be established under relaxed conditions. An implication is a convexity characterization of dynamic teams under static measurement with control-sharing reduction. Some counterparts for stochastic games are also discussed.
AB - In stochastic control, information structure arguments have been crucial for stochastic analysis. Such an approach is often called static reduction in dynamic team theory (or decentralized stochastic control) and has been an effective method for establishing existence and approximation results for optimal policies. In this paper, we classify such static reductions into three categories: (i) those that are policy-independent (introduced by Witsenhausen in [17]), (ii) those that are policy-dependent (introduced by Ho and Chu [7], [8] for partially nested dynamic teams), and (iii) static measurement with control-sharing reduction (where the measurements become static although control actions are shared according to the partially nested information structure). For these reductions, while there exist bijection relationships between glob-ally optimal solutions of dynamic teams and their reductions, in general there is no bijection for person-by-person optimal policies. We also establish a similar result (but not identical) concerning stationary solutions. We present sufficient conditions under which bijection relationships hold. Under static measurement with control-sharing reduction, connections between optimality concepts can be established under relaxed conditions. An implication is a convexity characterization of dynamic teams under static measurement with control-sharing reduction. Some counterparts for stochastic games are also discussed.
UR - http://www.scopus.com/inward/record.url?scp=85126052731&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126052731&partnerID=8YFLogxK
U2 - 10.1109/CDC45484.2021.9683260
DO - 10.1109/CDC45484.2021.9683260
M3 - Conference contribution
AN - SCOPUS:85126052731
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 6231
EP - 6236
BT - 60th IEEE Conference on Decision and Control, CDC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 60th IEEE Conference on Decision and Control, CDC 2021
Y2 - 13 December 2021 through 17 December 2021
ER -