TY - JOUR
T1 - Stability of Gated Recurrent Unit Neural Networks
T2 - Convex Combination Formulation Approach
AU - Stipanović, Dušan M.
AU - Kapetina, Mirna N.
AU - Rapaić, Milan R.
AU - Murmann, Boris
N1 - Publisher Copyright:
© 2020, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2021/1
Y1 - 2021/1
N2 - In this paper, a particular discrete-time nonlinear and time-invariant system represented as a vector difference equation is analyzed for its stability properties. The motivation for analyzing this particular system is that it models gated recurrent unit neural networks commonly used and well known in machine learning applications. From the technical perspective, the analyses exploit the systems similarities to a convex combination of discrete-time systems, where one of the systems is trivial, and thus, the overall properties are mostly dependent on the other one. Stability results are formulated for the nonlinear system and its linearization with respect to the systems, in general, multiple equilibria. To motivate and illustrate the potential of these results in applications, some particular results are derived for the gated recurrent unit neural network models and a connection between local stability analysis and learning is provided.
AB - In this paper, a particular discrete-time nonlinear and time-invariant system represented as a vector difference equation is analyzed for its stability properties. The motivation for analyzing this particular system is that it models gated recurrent unit neural networks commonly used and well known in machine learning applications. From the technical perspective, the analyses exploit the systems similarities to a convex combination of discrete-time systems, where one of the systems is trivial, and thus, the overall properties are mostly dependent on the other one. Stability results are formulated for the nonlinear system and its linearization with respect to the systems, in general, multiple equilibria. To motivate and illustrate the potential of these results in applications, some particular results are derived for the gated recurrent unit neural network models and a connection between local stability analysis and learning is provided.
KW - Difference equations
KW - Neural networks
KW - Stability
UR - http://www.scopus.com/inward/record.url?scp=85095415489&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095415489&partnerID=8YFLogxK
U2 - 10.1007/s10957-020-01776-w
DO - 10.1007/s10957-020-01776-w
M3 - Article
AN - SCOPUS:85095415489
SN - 0022-3239
VL - 188
SP - 291
EP - 306
JO - Journal of Optimization Theory and Applications
JF - Journal of Optimization Theory and Applications
IS - 1
ER -