Learning general world models in a handful of reward-free deployments

Y Xu, J Parker-Holder, A Pacchiano… - Advances in …, 2022 - proceedings.neurips.cc
Building generally capable agents is a grand challenge for deep reinforcement learning
(RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate …

Reinforcement learning by guided safe exploration

Q Yang, TD Simão, N Jansen, SH Tindemans… - ECAI 2023, 2023 - ebooks.iospress.nl
Safety is critical to broadening the application of reinforcement learning (RL). Often, we train
RL agents in a controlled environment, such as a laboratory, before deploying them in the …

Exploring the limits of hierarchical world models in reinforcement learning

R Schiewer, A Subramoney, L Wiskott - Scientific Reports, 2024 - nature.com
Hierarchical model-based reinforcement learning (HMBRL) aims to combine the sample
efficiency of model-based reinforcement learning with the abstraction capability of …

Learning to play atari in a world of tokens

P Agarwal, S Andrews, SE Kahou - arxiv preprint arxiv:2406.01361, 2024 - arxiv.org
Model-based reinforcement learning agents utilizing transformers have shown improved
sample efficiency due to their ability to model extended context, resulting in more accurate …

[PDF][PDF] Training and transferring safe policies in reinforcement learning

Q Yang, T Simão, N Jansen, S Tindemans, M Spaan - 2022 - repository.ubn.ru.nl
Safety is critical to broadening the application of reinforcement learning (RL). Often, RL
agents are trained in a controlled environment, such as a laboratory, before being deployed …

Learning Diverse Skills for Safe Reinforcement Learning

H Cai - 2024 - search.proquest.com
Safety has long been a crucial component in robotic systems, particularly in unstructured
environments where robustness and scalability pose serious challenges to various learning …

Bayesian Partially Observable Reinforcement Learning

S Katt - 2023 - search.proquest.com
Autonomous agents are occupying more roles in our world than ever. They are present as AI
in games, decide on which ads users see on the internet, and are even considered in more …