Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Statistical learning theory for control: A finite-sample perspective
Learning algorithms have become an integral component to modern engineering solutions.
Examples range from self-driving cars and recommender systems to finance and even …
Examples range from self-driving cars and recommender systems to finance and even …
Neural operators for bypassing gain and control computations in PDE backstep**
We introduce a framework for eliminating the computation of controller gain functions in
partial differential equation (PDE) control. We learn the nonlinear operator from the plant …
partial differential equation (PDE) control. We learn the nonlinear operator from the plant …
[HTML][HTML] Neural operators of backstep** controller and observer gain functions for reaction–diffusion PDEs
Unlike ODEs, whose models involve system matrices and whose controllers involve vector
or matrix gains, PDE models involve functions in those roles—functional coefficients …
or matrix gains, PDE models involve functions in those roles—functional coefficients …
Data-enabled policy optimization for direct adaptive learning of the LQR
Direct data-driven design methods for the linear quadratic regulator (LQR) mainly use offline
or episodic data batches, and their online adaptation has been acknowledged as an open …
or episodic data batches, and their online adaptation has been acknowledged as an open …
On the optimization landscape of dynamic output feedback linear quadratic control
The convergence of policy gradient algorithms hinges on the optimization landscape of the
underlying optimal control problem. Theoretical insights into these algorithms can often be …
underlying optimal control problem. Theoretical insights into these algorithms can often be …
Global convergence of policy gradient primal–dual methods for risk-constrained LQRs
While the techniques in optimal control theory are often model-based, the policy optimization
(PO) approach directly optimizes the performance metric of interest. Even though it has been …
(PO) approach directly optimizes the performance metric of interest. Even though it has been …
Fast trac: A parameter-free optimizer for lifelong reinforcement learning
A key challenge in lifelong reinforcement learning (RL) is the loss of plasticity, where
previous learning progress hinders an agent's adaptation to new tasks. While regularization …
previous learning progress hinders an agent's adaptation to new tasks. While regularization …
Model-free learning with heterogeneous dynamical systems: A federated LQR approach
We study a model-free federated linear quadratic regulator (LQR) problem where M agents
with unknown, distinct yet similar dynamics collaboratively learn an optimal policy to …
with unknown, distinct yet similar dynamics collaboratively learn an optimal policy to …
Revisiting LQR control from the perspective of receding-horizon policy gradient
We revisit in this letter the discrete-time linear quadratic regulator (LQR) problem from the
perspective of receding-horizon policy gradient (RHPG), a newly developed model-free …
perspective of receding-horizon policy gradient (RHPG), a newly developed model-free …
A reinforcement learning look at risk-sensitive linear quadratic gaussian control
In this paper, we propose a robust reinforcement learning method for a class of linear
discrete-time systems to handle model mismatches that may be induced by sim-to-real gap …
discrete-time systems to handle model mismatches that may be induced by sim-to-real gap …