- Academic Search

B Liu, Y Zhu, C Gao, Y Feng, Q Liu… - Advances in Neural …, 2023 - proceedings.neurips.cc

Lifelong learning offers a promising paradigm of building a generalist agent that learns and
adapts over its lifespan. Unlike traditional lifelong learning problems in image and text …

Simpan Kutip Dirujuk 80 kali Artikel terkait 7 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

Simpan Kutip Dirujuk 221 kali Artikel terkait 9 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Evolving curricula with regret-based environment design

J Parker-Holder, M Jiang, M Dennis… - International …, 2022 - proceedings.mlr.press

Training generally-capable agents with reinforcement learning (RL) remains a significant
challenge. A promising avenue for improving the robustness of RL agents is through the use …

Simpan Kutip Dirujuk 133 kali Artikel terkait 6 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

B Ellis, J Cook, S Moalla… - Advances in …, 2023 - proceedings.neurips.cc

The availability of challenging benchmarks has played a key role in the recent progress of
machine learning. In cooperative multi-agent reinforcement learning, the StarCraft Multi …

Simpan Kutip Dirujuk 104 kali Artikel terkait 8 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Human-timescale adaptation in an open-ended task space

AA Team, J Bauer, K Baumli, S Baveja… - arxiv preprint arxiv …, 2023 - arxiv.org

Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Simpan Kutip Dirujuk 89 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Human-timescale adaptation in an open-ended task space

J Bauer, K Baumli, F Behbahani… - International …, 2023 - proceedings.mlr.press

Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Simpan Kutip Dirujuk 39 kali Artikel terkait 4 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Motif: Intrinsic motivation from artificial intelligence feedback

M Klissarov, P D'Oro, S Sodhani, R Raileanu… - arxiv preprint arxiv …, 2023 - arxiv.org

Exploring rich environments and evaluating one's actions without prior knowledge is
immensely challenging. In this paper, we propose Motif, a general method to interface such …

Simpan Kutip Dirujuk 56 kali Artikel terkait 4 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

A generalist neural algorithmic learner

B Ibarz, V Kurin, G Papamakarios… - Learning on graphs …, 2022 - proceedings.mlr.press

The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks,
especially in a way that generalises out of distribution. While recent years have seen a surge …

Simpan Kutip Dirujuk 71 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Exploration via elliptical episodic bonuses

M Henaff, R Raileanu, M Jiang… - Advances in Neural …, 2022 - proceedings.neurips.cc

In recent years, a number of reinforcement learning (RL) methods have been pro-posed to
explore complex environments which differ across episodes. In this work, we show that the …

Simpan Kutip Dirujuk 44 kali Artikel terkait 8 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Improving intrinsic exploration with language abstractions

J Mu, V Zhong, R Raileanu, M Jiang… - Advances in …, 2022 - proceedings.neurips.cc

Reinforcement learning (RL) agents are particularly hard to train when rewards are sparse.
One common solution is to use intrinsic rewards to encourage agents to explore their …

Simpan Kutip Dirujuk 67 kali Artikel terkait 8 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Minihack the planet: A sandbox for open-ended reinforcement learning research

Libero: Benchmarking knowledge transfer for lifelong robot learning

A survey of zero-shot generalisation in deep reinforcement learning

Evolving curricula with regret-based environment design

Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning

Human-timescale adaptation in an open-ended task space

Human-timescale adaptation in an open-ended task space

Motif: Intrinsic motivation from artificial intelligence feedback

A generalist neural algorithmic learner

Exploration via elliptical episodic bonuses

Improving intrinsic exploration with language abstractions