محقق Google

D Ding, CY Wei, K Zhang… - Advances in Neural …, 2023‏ - proceedings.neurips.cc‏

We study the problem of computing an optimal policy of an infinite-horizon discounted
constrained Markov decision process (constrained MDP). Despite the popularity of …‏

ذخیره ارجاع بیان شده در 27 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View‏

Structure in deep reinforcement learning: A survey and open problems‏

A Mohan, A Zhang, M Lindauer - Journal of Artificial Intelligence Research, 2024‏ - jair.org‏

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …‏

ذخیره ارجاع بیان شده در 23 یافته مقاله‌های مربوط تمام نسخه‌های 10 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Safety-constrained reinforcement learning with a distributional safety critic‏

Q Yang, TD Simão, SH Tindemans, MTJ Spaan - Machine Learning, 2023‏ - Springer‏

Safety is critical to broadening the real-world use of reinforcement learning. Modeling the
safety aspects using a safety-cost signal separate from the reward and bounding the …‏

ذخیره ارجاع بیان شده در 49 یافته مقاله‌های مربوط تمام نسخه‌های 13

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Reload: Reinforcement learning with optimistic ascent-descent for last-iterate convergence in constrained mdps‏

T Moskovitz, B O'Donoghue, V Veeriah… - International …, 2023‏ - proceedings.mlr.press‏

In recent years, reinforcement learning (RL) has been applied to real-world problems with
increasing success. Such applications often require to put constraints on the agent's …‏

ذخیره ارجاع بیان شده در 22 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

DOPE: Doubly optimistic and pessimistic exploration for safe reinforcement learning‏

A Bura, A HasanzadeZonuzy… - Advances in neural …, 2022‏ - proceedings.neurips.cc‏

Safe reinforcement learning is extremely challenging--not only must the agent explore an
unknown environment, it must do so while ensuring no safety constraint violations. We …‏

ذخیره ارجاع بیان شده در 40 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Verification-Guided Shielding for Deep Reinforcement Learning‏

D Corsi, G Amir, A Rodríguez, C Sánchez… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach
to solving real-world tasks. However, despite their successes, DRL-based policies suffer …‏

ذخیره ارجاع بیان شده در 7 یافته مقاله‌های مربوط تمام نسخه‌های 10 نسخه HTML

Autonomous driving based on approximate safe action‏

X Wang, J Zhang, D Hou… - IEEE Transactions on …, 2023‏ - ieeexplore.ieee.org‏

Safety limits the application of traditional reinforcement learning (RL) methods to
autonomous driving. To address the challenge of safe exploration in autonomous driving …‏

ذخیره ارجاع بیان شده در 15 یافته مقاله‌های مربوط تمام نسخه‌های 3

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View‏

Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms‏

F De Nijs, E Walraven, M De Weerdt, M Spaan - Journal of Artificial …, 2021‏ - jair.org‏

In domains such as electric vehicle charging, smart distribution grids and autonomous
warehouses, multiple agents share the same resources. When planning the use of these …‏

ذخیره ارجاع بیان شده در 41 یافته مقاله‌های مربوط تمام نسخه‌های 12 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Enhancing safe exploration using safety state augmentation‏

A Sootla, A Cowen-Rivers, J Wang… - Advances in Neural …, 2022‏ - proceedings.neurips.cc‏

Safe exploration is a challenging and important problem in model-free reinforcement
learning (RL). Often the safety cost is sparse and unknown, which unavoidably leads to …‏

ذخیره ارجاع بیان شده در 18 یافته مقاله‌های مربوط تمام نسخه‌های 3 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Safe reinforcement learning with natural language constraints‏

TY Yang, MY Hu, Y Chow… - Advances in …, 2021‏ - proceedings.neurips.cc‏

While safe reinforcement learning (RL) holds great promise for many practical applications
like robotics or autonomous cars, current approaches require specifying constraints in …‏

ذخیره ارجاع بیان شده در 39 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

AlwaysSafe: Reinforcement learning without safety constraint violations during training

Last-iterate convergent policy gradient primal-dual methods for constrained mdps‏

Structure in deep reinforcement learning: A survey and open problems‏

Safety-constrained reinforcement learning with a distributional safety critic‏

Reload: Reinforcement learning with optimistic ascent-descent for last-iterate convergence in constrained mdps‏

DOPE: Doubly optimistic and pessimistic exploration for safe reinforcement learning‏

Verification-Guided Shielding for Deep Reinforcement Learning‏

Autonomous driving based on approximate safe action‏

Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms‏

Enhancing safe exploration using safety state augmentation‏

Safe reinforcement learning with natural language constraints‏