Abstract
Adaptively controlling and minimizing regret in unknown dynamical systems while controlling the growth of the system state is crucial in real-world applications. In this work, we study the problem of stabilization and regret minimization of linear over-actuated dynamical systems. We propose an optimism-based algorithm that leverages possibility of switching between actuating modes in order to alleviate state explosion during initial time steps. We theoretically study the rate at which our algorithm learns a stabilizing controller and prove that it achieves a regret upper bound of O(√T).
Original language | English (US) |
---|---|
Pages (from-to) | 79-84 |
Number of pages | 6 |
Journal | IFAC-PapersOnLine |
Volume | 55 |
Issue number | 25 |
DOIs | |
State | Published - 2022 |
Externally published | Yes |
Event | 10th IFAC Symposium on Robust Control Design, ROCOND 2022 - Kyoto, Japan Duration: Aug 30 2022 → Sep 2 2022 |
Keywords
- Actuator Redundancy
- Adaptive Control
- Model-Based Reinforcement Learning
- Regret Bound
- stabilization
ASJC Scopus subject areas
- Control and Systems Engineering