Академия Google

J Garcıa, F Fernández - Journal of Machine Learning Research, 2015 - jmlr.org

Abstract Safe Reinforcement Learning can be defined as the process of learning policies
that maximize the expectation of the return in problems in which it is important to ensure …

Сохранить Цитировать Цитируется: 2105 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] jair.org

A survey on transfer learning for multiagent reinforcement learning systems

FL Da Silva, AHR Costa - Journal of Artificial Intelligence Research, 2019 - jair.org

Multiagent Reinforcement Learning (RL) solves complex tasks that require coordination with
other agents through autonomous exploration of the environment. However, learning a …

Сохранить Цитировать Цитируется: 339 Похожие статьи Все версии статьи (10) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

The importance of pessimism in fixed-dataset policy optimization

J Buckman, C Gelada, MG Bellemare - arxiv preprint arxiv:2009.06799, 2020 - arxiv.org

We study worst-case guarantees on the expected return of fixed-dataset policy optimization
algorithms. Our core contribution is a unified conceptual and mathematical framework for the …

Сохранить Цитировать Цитируется: 162 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Agent-agnostic human-in-the-loop reinforcement learning

D Abel, J Salvatier, A Stuhlmüller, O Evans - arxiv preprint arxiv …, 2017 - arxiv.org

Providing Reinforcement Learning agents with expert advice can dramatically improve
various aspects of learning. Prior work has developed teaching protocols that enable agents …

Сохранить Цитировать Цитируется: 86 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

A view of margin losses as regularizers of probability estimates

H Masnadi-Shirazi, N Vasconcelos - The Journal of Machine Learning …, 2015 - dl.acm.org

Regularization is commonly used in classifier design, to assure good generalization.
Classical regularization enforces a cost on classifier complexity, by constraining parameters …

Сохранить Цитировать Цитируется: 16 Похожие статьи Все версии статьи (7) Поиск в библиотеках

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning under algorithmic triage

E Straitouri, A Singla, VB Meresht… - arxiv preprint arxiv …, 2021 - arxiv.org

Methods to learn under algorithmic triage have predominantly focused on supervised
learning settings where each decision, or prediction, is independent of each other. Under …

Сохранить Цитировать Цитируется: 17 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Relational reinforcement learning for planning with exogenous effects

D Mart, G Aleny, T Ribeiro, K Inoue, C Torras - Journal of Machine …, 2017 - jmlr.org

Probabilistic planners have improved recently to the point that they can solve difficult tasks
with complex and expressive models. In contrast, learners cannot tackle yet the expressive …

Сохранить Цитировать Цитируется: 33 Похожие статьи Все версии статьи (12) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mpg.de

[PDF][PDF] Learning to switch among agents in a team

V Balazadeh Meresht, A De, A Singla… - … on Machine Learning …, 2022 - pure.mpg.de

Reinforcement learning agents have been mostly developed and evaluated under the
assumption that they will operate in a fully autonomous manner—they will take all actions. In …

Сохранить Цитировать Цитируется: 3 Похожие статьи В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Scheduled policy optimization for natural language communication with intelligent agents

W **ong, X Guo, M Yu, S Chang, B Zhou… - arxiv preprint arxiv …, 2018 - arxiv.org

We investigate the task of learning to follow natural language instructions by jointly
reasoning with visual observations and language inputs. In contrast to existing methods …

Сохранить Цитировать Цитируется: 11 Похожие статьи Все версии статьи (9) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] ijcai.org

Transfer learning for multiagent reinforcement learning systems [J]

F Silva, A Costa - Synthesis Lectures on Artificial Intelligence and …, 2021 - Springer

Learning to solve sequential decision-making tasks is difficult. Humans take years exploring
the environment essentially in a random way until they are able to reason, solve difficult …

Сохранить Цитировать Цитируется: 9 Похожие статьи Все версии статьи (7) Поиск в библиотеках

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Blending autonomous exploration and apprenticeship learning

[PDF][PDF] A comprehensive survey on safe reinforcement learning

A survey on transfer learning for multiagent reinforcement learning systems

The importance of pessimism in fixed-dataset policy optimization

Agent-agnostic human-in-the-loop reinforcement learning

A view of margin losses as regularizers of probability estimates

Reinforcement learning under algorithmic triage

Relational reinforcement learning for planning with exogenous effects

[PDF][PDF] Learning to switch among agents in a team

Scheduled policy optimization for natural language communication with intelligent agents

Transfer learning for multiagent reinforcement learning systems [J]