- Academic Search

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Gem Citer Citeret af 300 Relaterede artikler Alle 2 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] wiley.com

Recent advances in reinforcement learning in finance

B Hambly, R Xu, H Yang - Mathematical Finance, 2023 - Wiley Online Library

The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …

Gem Citer Citeret af 212 Relaterede artikler Alle 13 versioner

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Distributional reinforcement learning with quantile regression

W Dabney, M Rowland, M Bellemare… - Proceedings of the AAAI …, 2018 - ojs.aaai.org

In reinforcement learning (RL), an agent interacts with the environment by taking actions and
observing the next state and reward. When sampled probabilistically, these state transitions …

Gem Citer Citeret af 945 Relaterede artikler Alle 8 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reward constrained policy optimization

C Tessler, DJ Mankowitz, S Mannor - arxiv preprint arxiv:1805.11074, 2018 - arxiv.org

Solving tasks in Reinforcement Learning is no easy feat. As the goal of the agent is to
maximize the accumulated reward, it often learns to exploit loopholes and misspecifications …

Gem Citer Citeret af 628 Relaterede artikler Alle 4 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org

A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

Gem Citer Citeret af 446 Relaterede artikler Alle 18 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

DD Fan, K Otsu, Y Kubo, A Dixit, J Burdick… - arxiv preprint arxiv …, 2021 - arxiv.org

Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …

Gem Citer Citeret af 138 Relaterede artikler Alle 7 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Constrained reinforcement learning has zero duality gap

S Paternain, L Chamon… - Advances in Neural …, 2019 - proceedings.neurips.cc

Autonomous agents must often deal with conflicting requirements, such as completing tasks
using the least amount of time/energy, learning multiple tasks, or dealing with multiple …

Gem Citer Citeret af 215 Relaterede artikler Alle 8 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

IPO: Interior-point policy optimization under constraints

Y Liu, J Ding, X Liu - Proceedings of the AAAI conference on artificial …, 2020 - ojs.aaai.org

In this paper, we study reinforcement learning (RL) algorithms to solve real-world decision
problems with the objective of maximizing the long-term reward as well as satisfying …

Gem Citer Citeret af 209 Relaterede artikler Alle 10 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] kcl.ac.uk

A review of safe reinforcement learning: Methods, theories and applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Gem Citer Citeret af 16 Relaterede artikler Alle 8 versioner

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deployment-efficient reinforcement learning via model-based offline optimization

T Matsushima, H Furuta, Y Matsuo, O Nachum… - arxiv preprint arxiv …, 2020 - arxiv.org

Most reinforcement learning (RL) algorithms assume online access to the environment, in
which one may readily interleave updates to the policy with experience collection using that …

Gem Citer Citeret af 162 Relaterede artikler Alle 8 versioner Vis som HTML

Opret underretning

Citer

Avanceret søgning

Gemt i Min samling

Risk-sensitive and robust decision-making: a cvar optimization approach

A review of safe reinforcement learning: Methods, theory and applications

Recent advances in reinforcement learning in finance

Distributional reinforcement learning with quantile regression

Reward constrained policy optimization

A review of robot learning for manipulation: Challenges, representations, and algorithms

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

Constrained reinforcement learning has zero duality gap

IPO: Interior-point policy optimization under constraints

A review of safe reinforcement learning: Methods, theories and applications

Deployment-efficient reinforcement learning via model-based offline optimization