- Academic Search

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Zapisz Cytuj Cytowane przez 340 Powiązane artykuły Wszystkie wersje 9 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z **ong, L Zintgraf… - arxiv preprint arxiv …, 2023 - arxiv.org

While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …

Zapisz Cytuj Cytowane przez 171 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

JA Mendez, E Eaton - arxiv preprint arxiv:2207.07730, 2022 - arxiv.org

A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …

Zapisz Cytuj Cytowane przez 33 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Bootstrap your own skills: Learning to solve new tasks with large language model guidance

J Zhang, J Zhang, K Pertsch, Z Liu, X Ren… - arxiv preprint arxiv …, 2023 - arxiv.org

We propose BOSS, an approach that automatically learns to solve new long-horizon,
complex, and meaningful tasks by growing a learned skill library with minimal supervision …

Zapisz Cytuj Cytowane przez 61 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Hyper-decision transformer for efficient online policy adaptation

M Xu, Y Lu, Y Shen, S Zhang, D Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org

Decision Transformers (DT) have demonstrated strong performances in offline reinforcement
learning settings, but quickly adapting to unseen novel tasks remains challenging. To …

Zapisz Cytuj Cytowane przez 42 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]

[PDF] neurips.cc

Learning to discover skills through guidance

H Kim, BK Lee, H Lee, D Hwang… - Advances in …, 2024 - proceedings.neurips.cc

In the field of unsupervised skill discovery (USD), a major challenge is limited exploration,
primarily due to substantial penalties when skills deviate from their initial trajectories. To …

Zapisz Cytuj Cytowane przez 8 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] neurips.cc

Learning options via compression

Y Jiang, E Liu, B Eysenbach… - Advances in Neural …, 2022 - proceedings.neurips.cc

Identifying statistical regularities in solutions to some tasks in multi-task reinforcement
learning can accelerate the learning of new tasks. Skill learning offers one way of identifying …

Zapisz Cytuj Cytowane przez 21 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]

[PDF] academia.edu

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

A Mohan, A Zhang, M Lindauer - arxiv preprint arxiv:2306.16021, 2023 - academia.edu

Reinforcement Learning (RL), bolstered by the expressive capabilities of Deep Neural
Networks (DNNs) for function approximation, has demonstrated considerable success in …

Zapisz Cytuj Cytowane przez 17 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Sprint: Scalable policy pre-training via language instruction relabeling

J Zhang, K Pertsch, J Zhang… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

Pre-training robots with a rich set of skills can substantially accelerate the learning of
downstream tasks. Prior works have defined pre-training tasks via natural language …

Zapisz Cytuj Cytowane przez 13 Powiązane artykuły Wszystkie wersje 2

[Free GPT-4]

[PDF] aaai.org

Flow to control: Offline reinforcement learning with lossless primitive discovery

Y Yang, H Hu, W Li, S Li, J Yang, Q Zhao… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Offline reinforcement learning (RL) enables the agent to effectively learn from logged data,
which significantly extends the applicability of RL algorithms in real-world scenarios where …

Zapisz Cytuj Cytowane przez 14 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Skill-based meta-reinforcement learning

Towards continual reinforcement learning: A review and perspectives

A survey of meta-reinforcement learning

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

Bootstrap your own skills: Learning to solve new tasks with large language model guidance

Hyper-decision transformer for efficient online policy adaptation

Learning to discover skills through guidance

Learning options via compression

[PDF][PDF] Structure in reinforcement learning: A survey and open problems

Sprint: Scalable policy pre-training via language instruction relabeling

Flow to control: Offline reinforcement learning with lossless primitive discovery