A survey on offline reinforcement learning: Taxonomy, review, and open problems

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

Minigrid & miniworld: Modular & customizable reinforcement learning environments for goal-oriented tasks

M Chevalier-Boisvert, B Dai… - Advances in …, 2023 - proceedings.neurips.cc
We present the Minigrid and Miniworld libraries which provide a suite of goal-oriented 2D
and 3D environments. The libraries were explicitly created with a minimalistic design …

Learning agile soccer skills for a bipedal robot with deep reinforcement learning

T Haarnoja, B Moran, G Lever, SH Huang… - Science Robotics, 2024 - science.org
We investigated whether deep reinforcement learning (deep RL) is able to synthesize
sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be …

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo… - arxiv preprint arxiv …, 2022 - arxiv.org
Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …

Roco: Dialectic multi-robot collaboration with large language models

Z Mandi, S Jain, S Song - 2024 IEEE International Conference …, 2024 - ieeexplore.ieee.org
We propose a novel approach to multi-robot collaboration that harnesses the power of pre-
trained large language models (LLMs) for both high-level communication and low-level path …

The primacy bias in deep reinforcement learning

E Nikishin, M Schwarzer, P D'Oro… - International …, 2022 - proceedings.mlr.press
This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a
tendency to rely on early interactions and ignore useful evidence encountered later …

Safety gymnasium: A unified safe reinforcement learning benchmark

J Ji, B Zhang, J Zhou, X Pan… - Advances in …, 2023 - proceedings.neurips.cc
Artificial intelligence (AI) systems possess significant potential to drive societal progress.
However, their deployment often faces obstacles due to substantial safety concerns. Safe …

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press
Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

Synthetic experience replay

C Lu, P Ball, YW Teh… - Advances in Neural …, 2023 - proceedings.neurips.cc
A key theme in the past decade has been that when large neural networks and large
datasets combine they can produce remarkable results. In deep reinforcement learning (RL) …

robosuite: A modular simulation framework and benchmark for robot learning

Y Zhu, J Wong, A Mandlekar, R Martín-Martín… - arxiv preprint arxiv …, 2020 - arxiv.org
robosuite is a simulation framework for robot learning powered by the MuJoCo physics
engine. It offers a modular design for creating robotic tasks as well as a suite of benchmark …