Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Toward a theoretical foundation of policy optimization for learning control policies
Gradient-based methods have been widely used for system design and optimization in
diverse application domains. Recently, there has been a renewed interest in studying …
diverse application domains. Recently, there has been a renewed interest in studying …
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems
Q Wei, T Li - IEEE Transactions on Neural Networks and …, 2023 - ieeexplore.ieee.org
For discrete-time nonlinear systems, this research is concerned with optimal control
problems (OCPs) with constrained cost, and a novel value iteration with constrained cost …
problems (OCPs) with constrained cost, and a novel value iteration with constrained cost …
Global convergence of policy gradient primal–dual methods for risk-constrained LQRs
While the techniques in optimal control theory are often model-based, the policy optimization
(PO) approach directly optimizes the performance metric of interest. Even though it has been …
(PO) approach directly optimizes the performance metric of interest. Even though it has been …
Reinforcement learning with fast stabilization in linear dynamical systems
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable
linear dynamical systems. When learning a dynamical system, one needs to stabilize the …
linear dynamical systems. When learning a dynamical system, one needs to stabilize the …
Augmented rbmle-ucb approach for adaptive control of linear quadratic systems
We consider the problem of controlling an unknown stochastic linear system with quadratic
costs--called the adaptive LQ control problem. We re-examine an approach called``Reward …
costs--called the adaptive LQ control problem. We re-examine an approach called``Reward …
Primal-dual learning for the model-free risk-constrained linear quadratic regulator
Risk-aware control, though with promise to tackle unexpected events, requires a known
exact dynamical model. In this work, we propose a model-free framework to learn a risk …
exact dynamical model. In this work, we propose a model-free framework to learn a risk …
Linear quadratic control with risk constraints
We propose a new risk-constrained formulation of the classical Linear Quadratic (LQ)
stochastic control problem for general partially-observed systems. Our framework is …
stochastic control problem for general partially-observed systems. Our framework is …
Online system identification and control for linear systems with multiagent controllers over wireless interference channels
In this article, we consider identification and control for potentially unstable linear systems
with multiagent controllers in the presence of wireless interference channels among the …
with multiagent controllers in the presence of wireless interference channels among the …
Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs
We study the problem of computing deterministic optimal policies for constrained Markov
decision processes (MDPs) with continuous state and action spaces, which are widely …
decision processes (MDPs) with continuous state and action spaces, which are widely …
Risk-constrained linear quadratic control with one-step delayed sharing information pattern
This paper studies the linear quadratic control for non-Gaussian interconnected systems
(ISs). A nonclassical information pattern called delayed sharing pattern is considered. Most …
(ISs). A nonclassical information pattern called delayed sharing pattern is considered. Most …