- Academic Search

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Spara Citera Citerat av 297 Relaterade artiklar Alla 2 versionerna Se som HTML-version

[Free GPT-4]

[PDF] arxiv.org

Safe learning in robotics: From learning-based control to safe reinforcement learning

L Brunke, M Greeff, AW Hall, Z Yuan… - Annual Review of …, 2022 - annualreviews.org

The last half decade has seen a steep rise in the number of contributions on safe learning
methods for real-world robotic deployments from both the control and reinforcement learning …

Spara Citera Citerat av 717 Relaterade artiklar Alla 9 versionerna

[Free GPT-4]

[PDF] tor-lattimore.com

[BOK][B] Bandit algorithms

T Lattimore, C Szepesvári - 2020 - books.google.com

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …

Spara Citera Citerat av 3287 Relaterade artiklar Alla 9 versionerna Bibliotekssökning

[Free GPT-4]

[PDF] neurips.cc

Safe model-based reinforcement learning with stability guarantees

F Berkenkamp, M Turchetta… - Advances in neural …, 2017 - proceedings.neurips.cc

Reinforcement learning is a powerful paradigm for learning optimal policies from
experimental data. However, to find optimal policies, most reinforcement learning algorithms …

Spara Citera Citerat av 1080 Relaterade artiklar Alla 10 versionerna Se som HTML-version

[Free GPT-4]

[PDF] biorxiv.org

A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions

E Schulz, M Speekenbrink, A Krause - Journal of mathematical psychology, 2018 - Elsevier

This tutorial introduces the reader to Gaussian process regression as an expressive tool to
model, actively explore and exploit unknown functions. Gaussian process regression is a …

Spara Citera Citerat av 1442 Relaterade artiklar Alla 11 versionerna

[Free GPT-4]

[PDF] mlr.press

Learning for safety-critical control with control barrier functions

A Taylor, A Singletary, Y Yue… - Learning for Dynamics …, 2020 - proceedings.mlr.press

Modern nonlinear control theory seeks to endow systems with properties of stability and
safety, and have been deployed successfully in multiple domains. Despite this success …

Spara Citera Citerat av 300 Relaterade artiklar Alla 12 versionerna Se som HTML-version

[Free GPT-4]

[PDF] mlr.press

Safe reinforcement learning in constrained markov decision processes

A Wachi, Y Sui - International Conference on Machine …, 2020 - proceedings.mlr.press

Safe reinforcement learning has been a promising approach for optimizing the policy of an
agent that operates in safety-critical applications. In this paper, we propose an algorithm …

Spara Citera Citerat av 199 Relaterade artiklar Alla 16 versionerna Se som HTML-version

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

N Silva, H Werneck, T Silva, ACM Pereira… - Expert Systems with …, 2022 - Elsevier

Abstract Recommender Systems (RSs) have assumed a crucial role in several digital
companies by directly affecting their key performance indicators. Nowadays, in this era of big …

Spara Citera Citerat av 82 Relaterade artiklar Alla 2 versionerna

[Free GPT-4]

[PDF] mlr.press

Provably efficient safe exploration via primal-dual policy optimization

D Ding, X Wei, Z Yang, Z Wang… - … conference on artificial …, 2021 - proceedings.mlr.press

We study the safe reinforcement learning problem using the constrained Markov decision
processes in which an agent aims to maximize the expected total reward subject to a safety …

Spara Citera Citerat av 186 Relaterade artiklar Alla 9 versionerna Se som HTML-version

[Free GPT-4]

[PDF] arxiv.org

Safe controller optimization for quadrotors with Gaussian processes

F Berkenkamp, AP Schoellig… - 2016 IEEE international …, 2016 - ieeexplore.ieee.org

One of the most fundamental problems when designing controllers for dynamic systems is
the tuning of the controller parameters. Typically, a model of the system is used to obtain an …

Spara Citera Citerat av 383 Relaterade artiklar Alla 14 versionerna

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Safe exploration for optimization with Gaussian processes

A review of safe reinforcement learning: Methods, theory and applications

Safe learning in robotics: From learning-based control to safe reinforcement learning

[BOK][B] Bandit algorithms

Safe model-based reinforcement learning with stability guarantees

A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions

Learning for safety-critical control with control barrier functions

Safe reinforcement learning in constrained markov decision processes

Multi-armed bandits in recommendation systems: A survey of the state-of-the-art and future directions

Provably efficient safe exploration via primal-dual policy optimization

Safe controller optimization for quadrotors with Gaussian processes