Continual learning for robotics: Definition, framework, learning strategies, opportunities and challenges

T Lesort, V Lomonaco, A Stoian, D Maltoni, D Filliat… - Information fusion, 2020 - Elsevier
Continual learning (CL) is a particular machine learning paradigm where the data
distribution and learning objective change through time, or where all the training data and …

[HTML][HTML] Continual lifelong learning with neural networks: A review

GI Parisi, R Kemker, JL Part, C Kanan, S Wermter - Neural networks, 2019 - Elsevier
Humans and animals have the ability to continually acquire, fine-tune, and transfer
knowledge and skills throughout their lifespan. This ability, referred to as lifelong learning, is …

GPT-3-driven pedagogical agents to train children's curious question-asking skills

R Abdelghani, YH Wang, X Yuan, T Wang… - International Journal of …, 2024 - Springer
The ability of children to ask curiosity-driven questions is an important skill that helps
improve their learning. For this reason, previous research has explored designing specific …

Agent57: Outperforming the atari human benchmark

AP Badia, B Piot, S Kapturowski… - International …, 2020 - proceedings.mlr.press
Atari games have been a long-standing benchmark in the reinforcement learning (RL)
community for the past decade. This benchmark was proposed to test general competency …

Planning to explore via self-supervised world models

R Sekar, O Rybkin, K Daniilidis… - International …, 2020 - proceedings.mlr.press
Reinforcement learning allows solving complex tasks, however, the learning tends to be task-
specific and the sample efficiency remains a challenge. We present Plan2Explore, a self …

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Exploration by random network distillation

Y Burda, H Edwards, A Storkey, O Klimov - arxiv preprint arxiv …, 2018 - arxiv.org
We introduce an exploration bonus for deep reinforcement learning methods that is easy to
implement and adds minimal overhead to the computation performed. The bonus is the error …

Recurrent world models facilitate policy evolution

D Ha, J Schmidhuber - Advances in neural information …, 2018 - proceedings.neurips.cc
A generative recurrent neural network is quickly trained in an unsupervised manner to
model popular reinforcement learning environments through compressed spatio-temporal …

Dynamics-aware unsupervised discovery of skills

A Sharma, S Gu, S Levine, V Kumar… - arxiv preprint arxiv …, 2019 - arxiv.org
Conventionally, model-based reinforcement learning (MBRL) aims to learn a global model
for the dynamics of the environment. A good model can potentially enable planning …

Diversity is all you need: Learning skills without a reward function

B Eysenbach, A Gupta, J Ibarz, S Levine - arxiv preprint arxiv:1802.06070, 2018 - arxiv.org
Intelligent creatures can explore their environments and learn useful skills without
supervision. In this paper, we propose DIAYN ('Diversity is All You Need'), a method for …