Toward a theoretical foundation of policy optimization for learning control policies

B Hu, K Zhang, N Li, M Mesbahi… - Annual Review of …, 2023 - annualreviews.org
Gradient-based methods have been widely used for system design and optimization in
diverse application domains. Recently, there has been a renewed interest in studying …

Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems

Q Wei, T Li - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org
For discrete-time nonlinear systems, this research is concerned with optimal control
problems (OCPs) with constrained cost, and a novel value iteration with constrained cost …

Global convergence of policy gradient primal–dual methods for risk-constrained LQRs

F Zhao, K You, T Başar - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org
While the techniques in optimal control theory are often model-based, the policy optimization
(PO) approach directly optimizes the performance metric of interest. Even though it has been …

Reinforcement learning with fast stabilization in linear dynamical systems

S Lale, K Azizzadenesheli, B Hassibi… - International …, 2022 - proceedings.mlr.press
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable
linear dynamical systems. When learning a dynamical system, one needs to stabilize the …

Augmented rbmle-ucb approach for adaptive control of linear quadratic systems

A Mete, R Singh, PR Kumar - Advances in Neural …, 2022 - proceedings.neurips.cc
We consider the problem of controlling an unknown stochastic linear system with quadratic
costs--called the adaptive LQ control problem. We re-examine an approach called``Reward …

Primal-dual learning for the model-free risk-constrained linear quadratic regulator

F Zhao, K You - Learning for Dynamics and Control, 2021 - proceedings.mlr.press
Risk-aware control, though with promise to tackle unexpected events, requires a known
exact dynamical model. In this work, we propose a model-free framework to learn a risk …

Linear quadratic control with risk constraints

A Tsiamis, DS Kalogerias, A Ribeiro… - arxiv preprint arxiv …, 2021 - arxiv.org
We propose a new risk-constrained formulation of the classical Linear Quadratic (LQ)
stochastic control problem for general partially-observed systems. Our framework is …

Online system identification and control for linear systems with multiagent controllers over wireless interference channels

M Tang, S Cai, VKN Lau - IEEE Transactions on Automatic …, 2022 - ieeexplore.ieee.org
In this article, we consider identification and control for potentially unstable linear systems
with multiagent controllers in the presence of wireless interference channels among the …

Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs

S Rozada, D Ding, AG Marques, A Ribeiro - arxiv preprint arxiv …, 2024 - arxiv.org
We study the problem of computing deterministic optimal policies for constrained Markov
decision processes (MDPs) with continuous state and action spaces, which are widely …

Risk-constrained linear quadratic control with one-step delayed sharing information pattern

Y Wang, W Wang, K Zhang, Y Xu, R Su - Automatica, 2025 - Elsevier
This paper studies the linear quadratic control for non-Gaussian interconnected systems
(ISs). A nonclassical information pattern called delayed sharing pattern is considered. Most …