Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

A social path to human-like artificial intelligence

EA Duéñez-Guzmán, S Sadedin, JX Wang… - Nature machine …, 2023 - nature.com
Traditionally, cognitive and computer scientists have viewed intelligence solipsistically, as a
property of unitary agents devoid of social context. Given the success of contemporary …

Scaling up and distilling down: Language-guided robot skill acquisition

H Ha, P Florence, S Song - Conference on Robot Learning, 2023 - proceedings.mlr.press
We present a framework for robot skill acquisition, which 1) efficiently scale up data
generation of language-labelled robot data and 2) effectively distills this data down into a …

Learning quadrupedal locomotion over challenging terrain

J Lee, J Hwangbo, L Wellhausen, V Koltun, M Hutter - Science robotics, 2020 - science.org
Legged locomotion can extend the operational domain of robots to some of the most
challenging environments on Earth. However, conventional controllers for legged …

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

Emergent tool use from multi-agent autocurricula

B Baker, I Kanitscheider, T Markov, Y Wu… - International …, 2019 - openreview.net
Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …

Evolving curricula with regret-based environment design

J Parker-Holder, M Jiang, M Dennis… - International …, 2022 - proceedings.mlr.press
Training generally-capable agents with reinforcement learning (RL) remains a significant
challenge. A promising avenue for improving the robustness of RL agents is through the use …

Unity: A general platform for intelligent agents

A Juliani, VP Berges, E Teng, A Cohen… - arxiv preprint arxiv …, 2018 - arxiv.org
Recent advances in artificial intelligence have been driven by the presence of increasingly
realistic and complex simulated environments. However, many of the existing environments …

Human-timescale adaptation in an open-ended task space

AA Team, J Bauer, K Baumli, S Baveja… - arxiv preprint arxiv …, 2023 - arxiv.org
Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Gensim: Generating robotic simulation tasks via large language models

L Wang, Y Ling, Z Yuan, M Shridhar, C Bao… - arxiv preprint arxiv …, 2023 - arxiv.org
Collecting large amounts of real-world interaction data to train general robotic policies is
often prohibitively expensive, thus motivating the use of simulation data. However, existing …