- Academic Search

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

Enregistrer Citer Cité 442 fois Autres articles Les 5 versions Free GPT-4

[Free GPT-4]

[PDF] mdpi.com

Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

Enregistrer Citer Cité 112 fois Autres articles Les 8 versions Free GPT-4 En cache

[Free GPT-4]

[PDF] nowpublishers.com

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com

Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

Enregistrer Citer Cité 926 fois Autres articles Les 17 versions Free GPT-4 Recherche dans les bibliothèques Version HTML

[Free GPT-4]

[PDF] neurips.cc

Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation

TD Kulkarni, K Narasimhan, A Saeedi… - Advances in neural …, 2016 - proceedings.neurips.cc

Learning goal-directed behavior in environments with sparse feedback is a major challenge
for reinforcement learning algorithms. One of the key difficulties is insufficient exploration …

Enregistrer Citer Cité 1506 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] aaai.org

The option-critic architecture

PL Bacon, J Harb, D Precup - Proceedings of the AAAI conference on …, 2017 - ojs.aaai.org

Temporal abstraction is key to scaling up learning and planning in reinforcement learning.
While planning with temporally extended actions is well understood, creating such …

Enregistrer Citer Cité 1356 fois Autres articles Les 14 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Skill induction and planning with latent language

P Sharma, A Torralba, J Andreas - arxiv preprint arxiv:2110.01517, 2021 - arxiv.org

We present a framework for learning hierarchical policies from demonstrations, using sparse
natural language annotations to guide the discovery of reusable skills for autonomous …

Enregistrer Citer Cité 117 fois Autres articles Les 4 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Learning multi-level hierarchies with hindsight

A Levy, G Konidaris, R Platt, K Saenko - arxiv preprint arxiv:1712.00948, 2017 - arxiv.org

Hierarchical agents have the potential to solve sequential decision making tasks with
greater sample efficiency than their non-hierarchical counterparts because hierarchical …

Enregistrer Citer Cité 351 fois Autres articles Les 11 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Learning neuro-symbolic skills for bilevel planning

T Silver, A Athalye, JB Tenenbaum… - arxiv preprint arxiv …, 2022 - arxiv.org

Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …

Enregistrer Citer Cité 82 fois Autres articles Les 5 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] mlr.press

A laplacian framework for option discovery in reinforcement learning

MC Machado, MG Bellemare… - … on Machine Learning, 2017 - proceedings.mlr.press

Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …

Enregistrer Citer Cité 327 fois Autres articles Les 9 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] jair.org Full View

Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey

C Colas, T Karch, O Sigaud, PY Oudeyer - Journal of Artificial Intelligence …, 2022 - jair.org

Building autonomous machines that can explore open-ended environments, discover
possible interactions and build repertoires of skills is a general objective of artificial …

Enregistrer Citer Cité 136 fois Autres articles Les 25 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

Automatic discovery of subgoals in reinforcement learning using diverse density

Hierarchical reinforcement learning: A comprehensive survey

Hierarchical reinforcement learning: A survey and open research challenges

Model-based reinforcement learning: A survey

Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation

The option-critic architecture

Skill induction and planning with latent language

Learning multi-level hierarchies with hindsight

Learning neuro-symbolic skills for bilevel planning

A laplacian framework for option discovery in reinforcement learning

Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey