A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org
The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

[BOK][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com
Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Safe model-based reinforcement learning with stability guarantees

F Berkenkamp, M Turchetta… - Advances in neural …, 2017 - proceedings.neurips.cc
Reinforcement learning is a powerful paradigm for learning optimal policies from
experimental data. However, to find optimal policies, most reinforcement learning algorithms …

A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions

E Schulz, M Speekenbrink, A Krause - Journal of mathematical psychology, 2018 - Elsevier
This tutorial introduces the reader to Gaussian process regression as an expressive tool to
model, actively explore and exploit unknown functions. Gaussian process regression is a …

Learning for safety-critical control with control barrier functions

A Taylor, A Singletary, Y Yue… - Learning for Dynamics …, 2020 - proceedings.mlr.press
Modern nonlinear control theory seeks to endow systems with properties of stability and
safety, and have been deployed successfully in multiple domains. Despite this success …

Safe reinforcement learning in constrained markov decision processes

A Wachi, Y Sui - International Conference on Machine …, 2020 - proceedings.mlr.press
Safe reinforcement learning has been a promising approach for optimizing the policy of an
agent that operates in safety-critical applications. In this paper, we propose an algorithm …

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier
Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Provably efficient safe exploration via primal-dual policy optimization

D Ding, X Wei, Z Yang, Z Wang… - … conference on artificial …, 2021 - proceedings.mlr.press
We study the safe reinforcement learning problem using the constrained Markov decision
processes in which an agent aims to maximize the expected total reward subject to a safety …

Safe controller optimization for quadrotors with Gaussian processes

F Berkenkamp, AP Schoellig… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org
One of the most fundamental problems when designing controllers for dynamic systems is
the tuning of the controller parameters. Typically, a model of the system is used to obtain an …