- Academic Search

L Buşoniu, T De Bruin, D Tolić, J Kober… - Annual Reviews in …, 2018 - Elsevier

Reinforcement learning (RL) offers powerful algorithms to search for optimal controllers of
systems with nonlinear, possibly stochastic dynamics that are unknown or highly uncertain …

Lagre Referanse Sitert av 498 Beslektede artikler Alle 9 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the sample complexity of the linear quadratic regulator

S Dean, H Mania, N Matni, B Recht, S Tu - Foundations of Computational …, 2020 - Springer

This paper addresses the optimal control problem known as the linear quadratic regulator in
the case when the dynamics are unknown. We propose a multistage procedure, called …

Lagre Referanse Sitert av 661 Beslektede artikler Alle 11 versjoner

An approximate neuro-optimal solution of discounted guaranteed cost control design

D Wang, J Qiao, L Cheng - IEEE Transactions on Cybernetics, 2020 - ieeexplore.ieee.org

The adaptive optimal feedback stabilization is investigated in this article for discounted
guaranteed cost control of uncertain nonlinear dynamical systems. Via theoretical analysis …

Lagre Referanse Sitert av 140 Beslektede artikler Alle 3 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Actor-critic reinforcement learning for control with stability guarantee

M Han, L Zhang, J Wang, W Pan - IEEE Robotics and …, 2020 - ieeexplore.ieee.org

Reinforcement Learning (RL) and its integration with deep learning have achieved
impressive performance in various robotic control tasks, ranging from motion planning and …

Lagre Referanse Sitert av 122 Beslektede artikler Alle 11 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] ubc.ca

Deep reinforcement learning approaches for process control

SPK Spielberg, RB Gopaluni… - 2017 6th international …, 2017 - ieeexplore.ieee.org

In this work, we have extended the current success of deep learning and reinforcement
learning to process control problems. We have shown that if reward hypothesis functions are …

Lagre Referanse Sitert av 190 Beslektede artikler Alle 11 versjoner

Optimal tracking control of nonlinear multiagent systems using internal reinforce Q-learning

Z Peng, R Luo, J Hu, K Shi, SK Nguang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

In this article, a novel reinforcement learning (RL) method is developed to solve the optimal
tracking control problem of unknown nonlinear multiagent systems (MASs). Different from …

Lagre Referanse Sitert av 85 Beslektede artikler Alle 3 versjoner

Offline and online adaptive critic control designs with stability guarantee through value iteration

M Ha, D Wang, D Liu - IEEE Transactions on Cybernetics, 2021 - ieeexplore.ieee.org

This article is concerned with the stability of the closed-loop system using various control
policies generated by value iteration. Some stability properties involving admissibility …

Lagre Referanse Sitert av 53 Beslektede artikler Alle 3 versjoner

Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control

SAA Rizvi, Z Lin - Automatica, 2018 - Elsevier

Approximate dynamic programming techniques usually rely on the feedback of the
measurement of the complete state, which is generally not available in practical situations. In …

Lagre Referanse Sitert av 115 Beslektede artikler Alle 4 versjoner

A novel value iteration scheme with adjustable convergence rate

M Ha, D Wang, D Liu - IEEE Transactions on Neural Networks …, 2022 - ieeexplore.ieee.org

In this article, a novel value iteration scheme is developed with convergence and stability
discussions. A relaxation factor is introduced to adjust the convergence rate of the value …

Lagre Referanse Sitert av 44 Beslektede artikler Alle 4 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

Reinforcement learning-based linear quadratic regulation of continuous-time systems using dynamic output feedback

SAA Rizvi, Z Lin - IEEE transactions on cybernetics, 2019 - ieeexplore.ieee.org

In this paper, we propose a model-free solution to the linear quadratic regulation (LQR)
problem of continuous-time systems based on reinforcement learning using dynamic output …

Lagre Referanse Sitert av 100 Beslektede artikler Alle 4 versjoner

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Stability analysis of discrete-time infinite-horizon optimal control with discounted cost

Reinforcement learning for control: Performance, stability, and deep approximators

On the sample complexity of the linear quadratic regulator

An approximate neuro-optimal solution of discounted guaranteed cost control design

Actor-critic reinforcement learning for control with stability guarantee

Deep reinforcement learning approaches for process control

Optimal tracking control of nonlinear multiagent systems using internal reinforce Q-learning

Offline and online adaptive critic control designs with stability guarantee through value iteration

Output feedback Q-learning for discrete-time linear zero-sum games with application to the H-infinity control

A novel value iteration scheme with adjustable convergence rate

Reinforcement learning-based linear quadratic regulation of continuous-time systems using dynamic output feedback