- Academic Search

Save Cite Cited by 988 Related articles All 4 versions Free GPT-4 View as HTML

A generalist agent

S Reed, K Zolna, E Parisotto, SG Colmenarejo… - arxiv preprint arxiv …, 2022 - arxiv.org

Inspired by progress in large-scale language modeling, we apply a similar approach
towards building a single generalist agent beyond the realm of text outputs. The agent …

Save Cite Cited by 140 Related articles All 6 versions Free GPT-4 View as HTML

Contrastive learning as goal-conditioned reinforcement learning

B Eysenbach, T Zhang, S Levine… - Advances in Neural …, 2022 - proceedings.neurips.cc

In reinforcement learning (RL), it is easier to solve a task if given a good representation.
While deep RL should automatically acquire such good representations, prior work often …

Video prediction models as rewards for reinforcement learning

A Escontrela, A Adeniji, W Yan, A Jain… - Advances in …, 2024 - proceedings.neurips.cc

Specifying reward signals that allow agents to learn complex behaviors is a long-standing
challenge in reinforcement learning. A promising approach is to extract preferences for …

Save Cite Cited by 49 Related articles All 8 versions Free GPT-4 View as HTML

Ceil: Generalized contextual imitation learning

J Liu, L He, Y Kang, Z Zhuang… - Advances in Neural …, 2023 - proceedings.neurips.cc

In this paper, we present ContExtual Imitation Learning (CEIL), a general and broadly
applicable algorithm for imitation learning (IL). Inspired by the formulation of hindsight …

Save Cite Cited by 17 Related articles All 5 versions Free GPT-4 View as HTML

Fusion dynamical systems with machine learning in imitation learning: A comprehensive overview

Y Hu, FJ Abu-Dakka, F Chen, X Luo, Z Li, A Knoll… - Information …, 2024 - Elsevier

Imitation Learning (IL), also referred to as Learning from Demonstration (LfD), holds
significant promise for capturing expert motor skills through efficient imitation, facilitating …

Save Cite Cited by 3 Related articles All 4 versions Free GPT-4

Offline learning from demonstrations and unlabeled experience

K Zolna, A Novikov, K Konyushkova… - arxiv preprint arxiv …, 2020 - arxiv.org

Behavior cloning (BC) is often practical for robot learning because it allows a policy to be
trained offline without rewards, by supervised learning on expert demonstrations. However …

Save Cite Cited by 73 Related articles All 3 versions Free GPT-4 View as HTML

Imitating interactive intelligence

J Abramson, A Ahuja, I Barr, A Brussee… - arxiv preprint arxiv …, 2020 - arxiv.org

A common vision from science fiction is that robots will one day inhabit our physical spaces,
sense the world as we do, assist our physical labours, and communicate with us through …

Save Cite Cited by 74 Related articles All 2 versions Free GPT-4 View as HTML

Making efficient use of demonstrations to solve hard exploration problems

TL Paine, C Gulcehre, B Shahriari, M Denil… - arxiv preprint arxiv …, 2019 - arxiv.org

This paper introduces R2D3, an agent that makes efficient use of demonstrations to solve
hard exploration problems in partially observable environments with highly variable initial …

Save Cite Cited by 99 Related articles All 5 versions Free GPT-4 View as HTML