TY - JOUR
T1 - Stackelberg Strategies and Incentives in Multiperson Deterministic Decision Problems
AU - Zheng, Ying Ping
AU - Basar, Tamer
AU - Cruz, Jose B.
PY - 1984
Y1 - 1984
N2 - In this paper discrete and continuous-time two-person decision problems with a hierarchical decision structure are studied and applicability and appropriateness of a function—space approach in the derivation of causal real-time implementable optimal Stackelberg (incentive) strategies under various information patterns are discussed. Results on existence and derivation of incentive strategies for dynamic games formulated in abstract inner-product spaces, in the absence of any causality restriction on the leader's policies, are first presented and then these results are extended (and specialized) in two major directions: 1) discrete-time dynamic games with informational advantage to the leader at each stage of the decision process, which involves partial observation of the follower's decisions; and derivation of multistage incentive strategies for the leader under a feedback Stackelberg solution adapted to the feedback information pattern; and 2) derivation of causal, physically realizable optimum affine Stackelberg policies for both discrete and continuous-time problems, in terms of the gradients of the cost functionals evaluated at the optimum (achievable) operating point (which is in some cases the globally minimizing solution of the leader's cost functional). The paper is concluded with some applications of the theory to important special cases, some extensions to infinite-horizon problems, and some numerical examples that further illustrate these results.
AB - In this paper discrete and continuous-time two-person decision problems with a hierarchical decision structure are studied and applicability and appropriateness of a function—space approach in the derivation of causal real-time implementable optimal Stackelberg (incentive) strategies under various information patterns are discussed. Results on existence and derivation of incentive strategies for dynamic games formulated in abstract inner-product spaces, in the absence of any causality restriction on the leader's policies, are first presented and then these results are extended (and specialized) in two major directions: 1) discrete-time dynamic games with informational advantage to the leader at each stage of the decision process, which involves partial observation of the follower's decisions; and derivation of multistage incentive strategies for the leader under a feedback Stackelberg solution adapted to the feedback information pattern; and 2) derivation of causal, physically realizable optimum affine Stackelberg policies for both discrete and continuous-time problems, in terms of the gradients of the cost functionals evaluated at the optimum (achievable) operating point (which is in some cases the globally minimizing solution of the leader's cost functional). The paper is concluded with some applications of the theory to important special cases, some extensions to infinite-horizon problems, and some numerical examples that further illustrate these results.
UR - http://www.scopus.com/inward/record.url?scp=0021310405&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0021310405&partnerID=8YFLogxK
U2 - 10.1109/TSMC.1984.6313265
DO - 10.1109/TSMC.1984.6313265
M3 - Article
AN - SCOPUS:0021310405
SN - 0018-9472
VL - SMC-14
SP - 10
EP - 24
JO - IEEE Transactions on Systems, Man and Cybernetics
JF - IEEE Transactions on Systems, Man and Cybernetics
IS - 1
ER -