A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Recent advances in reinforcement learning in finance

B Hambly, R Xu, H Yang - Mathematical Finance, 2023 - Wiley Online Library
The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …

Distributional reinforcement learning with quantile regression

W Dabney, M Rowland, M Bellemare… - Proceedings of the AAAI …, 2018 - ojs.aaai.org
In reinforcement learning (RL), an agent interacts with the environment by taking actions and
observing the next state and reward. When sampled probabilistically, these state transitions …

Reward constrained policy optimization

C Tessler, DJ Mankowitz, S Mannor - arxiv preprint arxiv:1805.11074, 2018 - arxiv.org
Solving tasks in Reinforcement Learning is no easy feat. As the goal of the agent is to
maximize the accumulated reward, it often learns to exploit loopholes and misspecifications …

A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org
A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

DD Fan, K Otsu, Y Kubo, A Dixit, J Burdick… - arxiv preprint arxiv …, 2021 - arxiv.org
Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …

Constrained reinforcement learning has zero duality gap

S Paternain, L Chamon… - Advances in Neural …, 2019 - proceedings.neurips.cc
Autonomous agents must often deal with conflicting requirements, such as completing tasks
using the least amount of time/energy, learning multiple tasks, or dealing with multiple …

IPO: Interior-point policy optimization under constraints

Y Liu, J Ding, X Liu - Proceedings of the AAAI conference on artificial …, 2020 - ojs.aaai.org
In this paper, we study reinforcement learning (RL) algorithms to solve real-world decision
problems with the objective of maximizing the long-term reward as well as satisfying …

A review of safe reinforcement learning: Methods, theories and applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Deployment-efficient reinforcement learning via model-based offline optimization

T Matsushima, H Furuta, Y Matsuo, O Nachum… - arxiv preprint arxiv …, 2020 - arxiv.org
Most reinforcement learning (RL) algorithms assume online access to the environment, in
which one may readily interleave updates to the policy with experience collection using that …