Hierarchical reinforcement learning: A comprehensive survey
Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …
challenging long-horizon decision-making tasks into simpler subtasks. During the past …
Hierarchical reinforcement learning: A survey and open research challenges
Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …
by interacting with an environment in a trial-and-error fashion. When these environments are …
Model-based reinforcement learning: A survey
Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …
optimization, is an important challenge in artificial intelligence. Two key approaches to this …
Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation
Learning goal-directed behavior in environments with sparse feedback is a major challenge
for reinforcement learning algorithms. One of the key difficulties is insufficient exploration …
for reinforcement learning algorithms. One of the key difficulties is insufficient exploration …
The option-critic architecture
Temporal abstraction is key to scaling up learning and planning in reinforcement learning.
While planning with temporally extended actions is well understood, creating such …
While planning with temporally extended actions is well understood, creating such …
Skill induction and planning with latent language
We present a framework for learning hierarchical policies from demonstrations, using sparse
natural language annotations to guide the discovery of reusable skills for autonomous …
natural language annotations to guide the discovery of reusable skills for autonomous …
Learning multi-level hierarchies with hindsight
Hierarchical agents have the potential to solve sequential decision making tasks with
greater sample efficiency than their non-hierarchical counterparts because hierarchical …
greater sample efficiency than their non-hierarchical counterparts because hierarchical …
Learning neuro-symbolic skills for bilevel planning
Decision-making is challenging in robotics environments with continuous object-centric
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …
states, continuous actions, long horizons, and sparse feedback. Hierarchical approaches …
A laplacian framework for option discovery in reinforcement learning
Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …
Autotelic agents with intrinsically motivated goal-conditioned reinforcement learning: a short survey
Building autonomous machines that can explore open-ended environments, discover
possible interactions and build repertoires of skills is a general objective of artificial …
possible interactions and build repertoires of skills is a general objective of artificial …