Google Академія

Y Xu, J Parker-Holder, A Pacchiano… - Advances in …, 2022 - proceedings.neurips.cc

Building generally capable agents is a grand challenge for deep reinforcement learning
(RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate …

Зберегти Послатися Цитовано в 11 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] iospress.nl

Reinforcement learning by guided safe exploration

Q Yang, TD Simão, N Jansen, SH Tindemans… - ECAI 2023, 2023 - ebooks.iospress.nl

Safety is critical to broadening the application of reinforcement learning (RL). Often, we train
RL agents in a controlled environment, such as a laboratory, before deploying them in the …

Зберегти Послатися Цитовано в 10 джерелах Пов’язані статті Кількість версій: 9 Пошук бібліотеки

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Exploring the limits of hierarchical world models in reinforcement learning

R Schiewer, A Subramoney, L Wiskott - Scientific Reports, 2024 - nature.com

Hierarchical model-based reinforcement learning (HMBRL) aims to combine the sample
efficiency of model-based reinforcement learning with the abstraction capability of …

Зберегти Послатися Пов’язані статті Кількість версій: 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to play atari in a world of tokens

P Agarwal, S Andrews, SE Kahou - arxiv preprint arxiv:2406.01361, 2024 - arxiv.org

Model-based reinforcement learning agents utilizing transformers have shown improved
sample efficiency due to their ability to model extended context, resulting in more accurate …

Зберегти Послатися Цитовано в 2 джерелах Пов’язані статті Кількість версій: 9 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] ru.nl

[PDF][PDF] Training and transferring safe policies in reinforcement learning

Q Yang, T Simão, N Jansen, S Tindemans, M Spaan - 2022 - repository.ubn.ru.nl

Safety is critical to broadening the application of reinforcement learning (RL). Often, RL
agents are trained in a controlled environment, such as a laboratory, before being deployed …

Зберегти Послатися Цитовано в 8 джерелах Пов’язані статті Кількість версій: 9 Пошук бібліотеки Показати у форматі HTML

Learning Diverse Skills for Safe Reinforcement Learning

H Cai - 2024 - search.proquest.com

Safety has long been a crucial component in robotic systems, particularly in unstructured
environments where robustness and scalability pose serious challenges to various learning …

Зберегти Послатися Пов’язані статті Кількість версій: 2

[Free GPT-4]
[DeepSeek]

[PDF] northeastern.edu

Bayesian Partially Observable Reinforcement Learning

S Katt - 2023 - search.proquest.com

Autonomous agents are occupying more roles in our world than ever. They are present as AI
in games, decide on which ads users see on the internet, and are even considered in more …

Зберегти Послатися Пов’язані статті Кількість версій: 2

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Maximum entropy model-based reinforcement learning

Learning general world models in a handful of reward-free deployments

Reinforcement learning by guided safe exploration

Exploring the limits of hierarchical world models in reinforcement learning

Learning to play atari in a world of tokens

[PDF][PDF] Training and transferring safe policies in reinforcement learning

Learning Diverse Skills for Safe Reinforcement Learning

Bayesian Partially Observable Reinforcement Learning