Value preserving state-action abstractions

D Abel, N Umbanhowar, K Khetarpal… - International …, 2020 - proceedings.mlr.press
Abstraction can improve the sample efficiency of reinforcement learning. However, the
process of abstraction inherently discards information, potentially compromising an agent's …

Learning markov state abstractions for deep reinforcement learning

C Allen, N Parikh, O Gottesman… - Advances in Neural …, 2021 - proceedings.neurips.cc
A fundamental assumption of reinforcement learning in Markov decision processes (MDPs)
is that the relevant decision process is, in fact, Markov. However, when MDPs have rich …

Decentralized cooperative planning for automated vehicles with hierarchical monte carlo tree search

K Kurzer, C Zhou, JM Zöllner - 2018 IEEE intelligent vehicles …, 2018 - ieeexplore.ieee.org
Today's automated vehicles lack the ability to cooperate implicitly with others. This work
presents a Monte Carlo Tree Search (MCTS) based approach for decentralized cooperative …

Anytime integrated task and motion policies for stochastic environments

N Shah, DK Vasudevan, K Kumar… - … on Robotics and …, 2020 - ieeexplore.ieee.org
In order to solve complex, long-horizon tasks, intelligent robots need to carry out high-level,
abstract planning and reasoning in conjunction with motion planning. However, abstract …

Learning compositional neural programs with recursive tree search and planning

T Pierrot, G Ligner, SE Reed… - Advances in …, 2019 - proceedings.neurips.cc
We propose a novel reinforcement learning algorithm, AlphaNPI, that incorpo-rates the
strengths of Neural Programmer-Interpreters (NPI) and AlphaZero. NPI contributes structural …

Monte Carlo tree search with spectral expansion for planning with dynamical systems

B Rivière, J Lathrop, SJ Chung - Science Robotics, 2024 - science.org
The ability of a robot to plan complex behaviors with real-time computation, rather than
adhering to predesigned or offline-learned routines, alleviates the need for specialized …

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces

Z Huang, Q Liu, F Zhu, L Zhang, L Wu - Expert Systems with Applications, 2024 - Elsevier
The fundamental concept behind option-based hierarchical reinforcement learning (O-HRL)
is to obtain temporal coarse-grained actions and abstract complex situations. Although O …

Conditional abstraction trees for sample-efficient reinforcement learning

M Dadvar, RK Nayyar… - Uncertainty in Artificial …, 2023 - proceedings.mlr.press
In many real-world problems, the learning agent needs to learn a problem's abstractions and
solution simultaneously. However, most such abstractions need to be designed and refined …

Accelerating monte carlo tree search with probability tree state abstraction

Y Fu, M Sun, B Nie, Y Gao - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Monte Carlo Tree Search (MCTS) algorithms such as AlphaGo and MuZero have
achieved superhuman performance in many challenging tasks. However, the computational …

Multi-resolution POMDP planning for multi-object search in 3D

K Zheng, Y Sung, G Konidaris… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
Robots operating in households must find objects on shelves, under tables, and in
cupboards. In such environments, it is crucial to search efficiently at 3D scale while co** …