TY - JOUR
T1 - Batch-to-Batch Finite-Horizon LQ Control for Unknown Discrete-Time Linear Systems Via Stochastic Extremum Seeking
AU - Liu, Shu Jun
AU - Krstic, Miroslav
AU - Basar, Tamer
N1 - Funding Information:
Manuscript received June 1, 2016; revised September 29, 2016; accepted October 18, 2016. Date of publication October 21, 2016; date of current version July 26, 2017. The work was supported by National Natural Science Foundation of China (No. 61322311, 61673284). Recommended by Associate Editor M. Alamir.
Publisher Copyright:
© 2017 IEEE.
PY - 2017/8
Y1 - 2017/8
N2 - We employ our recent discrete-time stochastic averaging theorems and stochastic extremum seeking to iteratively (batch-to-batch) optimize open-loop control sequences for unknown but reachable discrete-time linear systems with a scalar input and without known system dimension, for a cost that is quadratic in the measurable output and the input. First, for a multivariable gradient-based stochastic extremum seeking algorithm we prove local exponential convergence to the optimal open-loop control sequence. Second, to remove the convergence rate's dependence on the Hessian matrix of the cost function, which is unknown since the system's model (the system matrices $(A,B,C)$) is unknown, we develop a multivariable discrete-time Newton-based stochastic extremum seeking method, design the Newton-based algorithm for the iteration of the input sequence, and prove local exponential convergence to the optimal input sequence. Finally, two simulation examples are given to illustrate the effectiveness of the two methods.
AB - We employ our recent discrete-time stochastic averaging theorems and stochastic extremum seeking to iteratively (batch-to-batch) optimize open-loop control sequences for unknown but reachable discrete-time linear systems with a scalar input and without known system dimension, for a cost that is quadratic in the measurable output and the input. First, for a multivariable gradient-based stochastic extremum seeking algorithm we prove local exponential convergence to the optimal open-loop control sequence. Second, to remove the convergence rate's dependence on the Hessian matrix of the cost function, which is unknown since the system's model (the system matrices $(A,B,C)$) is unknown, we develop a multivariable discrete-time Newton-based stochastic extremum seeking method, design the Newton-based algorithm for the iteration of the input sequence, and prove local exponential convergence to the optimal input sequence. Finally, two simulation examples are given to illustrate the effectiveness of the two methods.
KW - LQ control
KW - stochastic averaging
KW - stochastic extremum seeking
UR - http://www.scopus.com/inward/record.url?scp=85029316480&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85029316480&partnerID=8YFLogxK
U2 - 10.1109/TAC.2016.2620379
DO - 10.1109/TAC.2016.2620379
M3 - Article
AN - SCOPUS:85029316480
SN - 0018-9286
VL - 62
SP - 4116
EP - 4123
JO - IRE Transactions on Automatic Control
JF - IRE Transactions on Automatic Control
IS - 8
M1 - 7605544
ER -