Adaptive dynamic programming for control: A survey and recent advances

D Liu, S Xue, B Zhao, B Luo… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This article reviews the recent development of adaptive dynamic programming (ADP) with
applications in control. First, its applications in optimal regulation are introduced, and some …

Optimal and autonomous control using reinforcement learning: A survey

B Kiumarsi, KG Vamvoudakis… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
This paper reviews the current state of the art on reinforcement learning (RL)-based
feedback control solutions to optimal regulation and tracking of single and multiagent …

Resilient practical cooperative output regulation for MASs with unknown switching exosystem dynamics under DoS attacks

C Deng, D Zhang, G Feng - Automatica, 2022 - Elsevier
In this paper, the resilient practical cooperative output regulation problem (CORP) is
addressed for heterogeneous linear multi-agent systems with unknown switching exosystem …

Formulas for data-driven control: Stabilization, optimality, and robustness

C De Persis, P Tesi - IEEE Transactions on Automatic Control, 2019 - ieeexplore.ieee.org
In a paper by Willems et al., it was shown that persistently exciting data can be used to
represent the input-output behavior of a linear system. Based on this fundamental result, we …

Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems

X **n, Y Tu, V Stojanovic, H Wang, K Shi, S He… - Applied Mathematics and …, 2022 - Elsevier
In this paper, a novel online mode-free integral reinforcement learning algorithm is proposed
to solve the multiplayer non-zero sum games. We first collect and learn the subsystems …

Hamiltonian-driven adaptive dynamic programming with efficient experience replay

Y Yang, Y Pan, CZ Xu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
This article presents a novel efficient experience-replay-based adaptive dynamic
programming (ADP) for the optimal control problem of a class of nonlinear dynamical …

Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation

Y Yang, B Kiumarsi, H Modares… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This article presents a model-free-policy iteration (-PI) for the discrete-time linear quadratic
regulation (LQR) problem. To solve the algebraic Riccati equation arising from solving the …

Reinforcement learning for control: Performance, stability, and deep approximators

L Buşoniu, T De Bruin, D Tolić, J Kober… - Annual Reviews in …, 2018 - Elsevier
Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of
systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain …

Safe reinforcement learning: A control barrier function optimization approach

Z Marvi, B Kiumarsi - … Journal of Robust and Nonlinear Control, 2021 - Wiley Online Library
This article presents a learning‐based barrier certified method to learn safe optimal
controllers that guarantee operation of safety‐critical systems within their safe regions while …

Adaptive critic nonlinear robust control: A survey

D Wang, H He, D Liu - IEEE transactions on cybernetics, 2017 - ieeexplore.ieee.org
Adaptive dynamic programming (ADP) and reinforcement learning are quite relevant to each
other when performing intelligent optimization. They are both regarded as promising …