Recent advances in applying deep reinforcement learning for flow control: Perspectives and future directions

C Vignon, J Rabault, R Vinuesa - Physics of fluids, 2023 - pubs.aip.org
Deep reinforcement learning (DRL) has been applied to a variety of problems during the
past decade and has provided effective control strategies in high-dimensional and non …

How to train your robot with deep reinforcement learning: lessons we have learned

J Ibarz, J Tan, C Finn, M Kalakrishnan… - … Journal of Robotics …, 2021 - journals.sagepub.com
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …

A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

Learning to walk via deep reinforcement learning

T Haarnoja, S Ha, A Zhou, J Tan, G Tucker… - arxiv preprint arxiv …, 2018 - arxiv.org
Deep reinforcement learning (deep RL) holds the promise of automating the acquisition of
complex controllers that can map sensory inputs directly to low-level actions. In the domain …

Safe model-based reinforcement learning with stability guarantees

F Berkenkamp, M Turchetta… - Advances in neural …, 2017 - proceedings.neurips.cc
Reinforcement learning is a powerful paradigm for learning optimal policies from
experimental data. However, to find optimal policies, most reinforcement learning algorithms …

Model-ensemble trust-region policy optimization

T Kurutach, I Clavera, Y Duan, A Tamar… - arxiv preprint arxiv …, 2018 - arxiv.org
Model-free reinforcement learning (RL) methods are succeeding in a growing number of
tasks, aided by recent advances in deep learning. However, they tend to suffer from high …

Information theoretic MPC for model-based reinforcement learning

G Williams, N Wagener, B Goldfain… - … on robotics and …, 2017 - ieeexplore.ieee.org
We introduce an information theoretic model predictive control (MPC) algorithm capable of
handling complex cost criteria and general nonlinear dynamics. The generality of the …

Continuous deep q-learning with model-based acceleration

S Gu, T Lillicrap, I Sutskever… - … conference on machine …, 2016 - proceedings.mlr.press
Abstract Model-free reinforcement learning has been successfully applied to a range of
challenging problems, and has recently been extended to handle large neural network …

Global optimality guarantees for policy gradient methods

J Bhandari, D Russo - Operations Research, 2024 - pubsonline.informs.org
Policy gradients methods apply to complex, poorly understood, control problems by
performing stochastic gradient descent over a parameterized class of polices. Unfortunately …

[Књига][B] Lifelong machine learning

Z Chen, B Liu - 2018 - books.google.com
Lifelong Machine Learning, Second Edition is an introduction to an advanced machine
learning paradigm that continuously learns by accumulating past knowledge that it then …