The intelligent critic framework for advanced optimal control

D Wang, M Ha, M Zhao - Artificial Intelligence Review, 2022 - Springer
The idea of optimization can be regarded as an important basis of many disciplines and
hence is extremely useful for a large number of research fields, particularly for artificial …

Hamiltonian-driven adaptive dynamic programming with efficient experience replay

Y Yang, Y Pan, CZ Xu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
This article presents a novel efficient experience-replay-based adaptive dynamic
programming (ADP) for the optimal control problem of a class of nonlinear dynamical …

Model-Free λ-Policy Iteration for Discrete-Time Linear Quadratic Regulation

Y Yang, B Kiumarsi, H Modares… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This article presents a model-free-policy iteration (-PI) for the discrete-time linear quadratic
regulation (LQR) problem. To solve the algebraic Riccati equation arising from solving the …

Robust actor–critic learning for continuous-time nonlinear systems with unmodeled dynamics

Y Yang, W Gao, H Modares… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
This article considers the robust optimal control problem for a class of nonlinear systems in
the presence of unmodeled dynamics. An adaptive optimal controller is designed using the …

Hamiltonian-driven adaptive dynamic programming with approximation errors

Y Yang, H Modares, KG Vamvoudakis… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
In this article, we consider an iterative adaptive dynamic programming (ADP) algorithm
within the Hamiltonian-driven framework to solve the Hamilton–Jacobi–Bellman (HJB) …

Sampled-data consensus protocols for a class of second-order switched nonlinear multiagent systems

W Zou, J Guo, CK Ahn, Z **ang - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In this study, the sampled-data consensus problem is investigated for a class of
heterogeneous multiagent systems (MASs) in which each agent is described by a second …

Reinforcement-learning-based tracking control of waste water treatment process under realistic system conditions and control performance requirements

Q Yang, W Cao, W Meng, J Si - IEEE Transactions on Systems …, 2021 - ieeexplore.ieee.org
The tracking control of a wastewater treatment process (WWTP) is considered. The process
is highly nonlinear, with strong coupling, difficult to model mathematically, and the operation …

Collision-free path planning for welding manipulator via hybrid algorithm of deep reinforcement learning and inverse kinematics

J Zhong, T Wang, L Cheng - Complex & Intelligent Systems, 2021 - Springer
In actual welding scenarios, an effective path planner is needed to find a collision-free path
in the configuration space for the welding manipulator with obstacles around. However, as a …

Safe reinforcement learning for dynamical games

Y Yang, KG Vamvoudakis… - International Journal of …, 2020 - Wiley Online Library
This article presents a novel actor‐critic‐barrier structure for the multiplayer safety‐critical
systems. Non‐zero‐sum (NZS) games with full‐state constraints are first transformed into …

Adaptive dynamic event-triggered bipartite time-varying output formation tracking problem of heterogeneous multiagent systems

J Zhang, H Zhang, S Sun - IEEE Transactions on Systems, Man …, 2023 - ieeexplore.ieee.org
This article investigates the bipartite time-varying output formation tracking problem of
heterogeneous linear multiagent systems with disturbances by adaptive dynamic event …