Hierarchical reinforcement learning: A comprehensive survey

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org
Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com
Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

Augmenting reinforcement learning with behavior primitives for diverse manipulation tasks

S Nasiriany, H Liu, Y Zhu - 2022 International Conference on …, 2022 - ieeexplore.ieee.org
Realistic manipulation tasks require a robot to interact with an environment with a prolonged
sequence of motor actions. While deep reinforcement learning methods have recently …

Benchmarking reinforcement learning algorithms on real-world robots

AR Mahmood, D Korenkevych… - … on robot learning, 2018 - proceedings.mlr.press
Through many recent successes in simulation, model-free reinforcement learning has
emerged as a promising approach to solving continuous control robotic tasks. The research …

Spatial memory-augmented visual navigation based on hierarchical deep reinforcement learning in unknown environments

S **, X Wang, Q Meng - Knowledge-Based Systems, 2024 - Elsevier
Visual navigation in unknown environments poses significant challenges due to the
presence of many obstacles and low-texture scenes. These factors may cause frequent …

Option discovery using deep skill chaining

A Bagaria, G Konidaris - International Conference on Learning …, 2019 - openreview.net
Autonomously discovering temporally extended actions, or skills, is a longstanding goal of
hierarchical reinforcement learning. We propose a new algorithm that combines skill …

Learning actionable representations with goal-conditioned policies

D Ghosh, A Gupta, S Levine - arxiv preprint arxiv:1811.07819, 2018 - arxiv.org
Representation learning is a central challenge across a range of machine learning areas. In
reinforcement learning, effective and functional representations have the potential to …

DAC: The double actor-critic architecture for learning options

S Zhang, S Whiteson - Advances in Neural Information …, 2019 - proceedings.neurips.cc
We reformulate the option framework as two parallel augmented MDPs. Under this novel
formulation, all policy optimization algorithms can be used off the shelf to learn intra-option …

Learning abstract options

M Riemer, M Liu, G Tesauro - Advances in neural …, 2018 - proceedings.neurips.cc
Building systems that autonomously create temporal abstractions from data is a key
challenge in scaling learning and planning in reinforcement learning. One popular approach …

Value function spaces: Skill-centric state abstractions for long-horizon reasoning

D Shah, P Xu, Y Lu, T **ao, A Toshev, S Levine… - arxiv preprint arxiv …, 2021 - arxiv.org
Reinforcement learning can train policies that effectively perform complex tasks. However for
long-horizon tasks, the performance of these methods degrades with horizon, often …