A survey on deep reinforcement learning algorithms for robotic manipulation

D Han, B Mulyana, V Stankovic, S Cheng - Sensors, 2023 - mdpi.com
Robotic manipulation challenges, such as gras** and object manipulation, have been
tackled successfully with the help of deep reinforcement learning systems. We give an …

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo… - arxiv preprint arxiv …, 2022 - arxiv.org
Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …

Contrastive learning as goal-conditioned reinforcement learning

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022 - proceedings.neurips.cc
In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …

Video prediction models as rewards for reinforcement learning

A Escontrela, A Adeniji, W Yan, A Jain… - Advances in …, 2024 - proceedings.neurips.cc
Specifying reward signals that allow agents to learn complex behaviors is a long-standing
challenge in reinforcement learning. A promising approach is to extract preferences for …

Ceil: Generalized contextual imitation learning

J Liu, L He, Y Kang, Z Zhuang… - Advances in Neural …, 2023 - proceedings.neurips.cc
In this paper, we present ContExtual Imitation Learning (CEIL), a general and broadly
applicable algorithm for imitation learning (IL). Inspired by the formulation of hindsight …

Fusion dynamical systems with machine learning in imitation learning: A comprehensive overview

Y Hu, FJ Abu-Dakka, F Chen, X Luo, Z Li, A Knoll… - Information …, 2024 - Elsevier
Imitation Learning (IL), also referred to as Learning from Demonstration (LfD), holds
significant promise for capturing expert motor skills through efficient imitation, facilitating …

Offline learning from demonstrations and unlabeled experience

K Zolna, A Novikov, K Konyushkova… - arxiv preprint arxiv …, 2020 - arxiv.org
Behavior cloning (BC) is often practical for robot learning because it allows a policy to be
trained offline without rewards, by supervised learning on expert demonstrations. However …

Imitating interactive intelligence

J Abramson, A Ahuja, I Barr, A Brussee… - arxiv preprint arxiv …, 2020 - arxiv.org
A common vision from science fiction is that robots will one day inhabit our physical spaces,
sense the world as we do, assist our physical labours, and communicate with us through …

Making efficient use of demonstrations to solve hard exploration problems

TL Paine, C Gulcehre, B Shahriari, M Denil… - arxiv preprint arxiv …, 2019 - arxiv.org
This paper introduces R2D3, an agent that makes efficient use of demonstrations to solve
hard exploration problems in partially observable environments with highly variable initial …

Generalizable imitation learning from observation via inferring goal proximity

Y Lee, A Szot, SH Sun, JJ Lim - Advances in neural …, 2021 - proceedings.neurips.cc
Task progress is intuitive and readily available task information that can guide an agent
closer to the desired goal. Furthermore, a task progress estimator can generalize to new …