- Academic Search

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Salva Cita Citato da 351 Articoli correlati Tutte e 10 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A social path to human-like artificial intelligence

EA Duéñez-Guzmán, S Sadedin, JX Wang… - Nature machine …, 2023 - nature.com

Traditionally, cognitive and computer scientists have viewed intelligence solipsistically, as a
property of unitary agents devoid of social context. Given the success of contemporary …

Salva Cita Citato da 33 Articoli correlati Tutte e 5 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Scaling up and distilling down: Language-guided robot skill acquisition

H Ha, P Florence, S Song - Conference on Robot Learning, 2023 - proceedings.mlr.press

We present a framework for robot skill acquisition, which 1) efficiently scale up data
generation of language-labelled robot data and 2) effectively distills this data down into a …

Salva Cita Citato da 128 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] science.org

Learning quadrupedal locomotion over challenging terrain

J Lee, J Hwangbo, L Wellhausen, V Koltun, M Hutter - Science robotics, 2020 - science.org

Legged locomotion can extend the operational domain of robots to some of the most
challenging environments on Earth. However, conventional controllers for legged …

Salva Cita Citato da 1248 Articoli correlati Tutte e 16 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

Salva Cita Citato da 221 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Emergent tool use from multi-agent autocurricula

B Baker, I Kanitscheider, T Markov, Y Wu… - International …, 2019 - openreview.net

Through multi-agent competition, the simple objective of hide-and-seek, and standard
reinforcement learning algorithms at scale, we find that agents create a self-supervised …

Salva Cita Citato da 915 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Evolving curricula with regret-based environment design

J Parker-Holder, M Jiang, M Dennis… - International …, 2022 - proceedings.mlr.press

Training generally-capable agents with reinforcement learning (RL) remains a significant
challenge. A promising avenue for improving the robustness of RL agents is through the use …

Salva Cita Citato da 132 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Unity: A general platform for intelligent agents

A Juliani, VP Berges, E Teng, A Cohen… - arxiv preprint arxiv …, 2018 - arxiv.org

Recent advances in artificial intelligence have been driven by the presence of increasingly
realistic and complex simulated environments. However, many of the existing environments …

Salva Cita Citato da 1261 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Human-timescale adaptation in an open-ended task space

AA Team, J Bauer, K Baumli, S Baveja… - arxiv preprint arxiv …, 2023 - arxiv.org

Foundation models have shown impressive adaptation and scalability in supervised and self-
supervised learning problems, but so far these successes have not fully translated to …

Salva Cita Citato da 89 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Gensim: Generating robotic simulation tasks via large language models

L Wang, Y Ling, Z Yuan, M Shridhar, C Bao… - arxiv preprint arxiv …, 2023 - arxiv.org

Collecting large amounts of real-world interaction data to train general robotic policies is
often prohibitively expensive, thus motivating the use of simulation data. However, existing …

Salva Cita Citato da 66 Articoli correlati Tutte e 5 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Paired open-ended trailblazer (poet): Endlessly generating increasingly complex and diverse...

Towards continual reinforcement learning: A review and perspectives

A social path to human-like artificial intelligence

Scaling up and distilling down: Language-guided robot skill acquisition

Learning quadrupedal locomotion over challenging terrain

A survey of zero-shot generalisation in deep reinforcement learning

Emergent tool use from multi-agent autocurricula

Evolving curricula with regret-based environment design

Unity: A general platform for intelligent agents

Human-timescale adaptation in an open-ended task space

Gensim: Generating robotic simulation tasks via large language models