Google Наука

D Abel, N Umbanhowar, K Khetarpal… - International …, 2020 - proceedings.mlr.press

Abstraction can improve the sample efficiency of reinforcement learning. However, the
process of abstraction inherently discards information, potentially compromising an agent's …

Запазване Позоваване С позовавания в 68 Сродни статии Всички 7 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning markov state abstractions for deep reinforcement learning

C Allen, N Parikh, O Gottesman… - Advances in Neural …, 2021 - proceedings.neurips.cc

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs)
is that the relevant decision process is, in fact, Markov. However, when MDPs have rich …

Запазване Позоваване С позовавания в 49 Сродни статии Всички 17 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Decentralized cooperative planning for automated vehicles with hierarchical monte carlo tree search

K Kurzer, C Zhou, JM Zöllner - 2018 IEEE intelligent vehicles …, 2018 - ieeexplore.ieee.org

Today's automated vehicles lack the ability to cooperate implicitly with others. This work
presents a Monte Carlo Tree Search (MCTS) based approach for decentralized cooperative …

Запазване Позоваване С позовавания в 63 Сродни статии Всички 6 версии

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Anytime integrated task and motion policies for stochastic environments

N Shah, DK Vasudevan, K Kumar… - … on Robotics and …, 2020 - ieeexplore.ieee.org

In order to solve complex, long-horizon tasks, intelligent robots need to carry out high-level,
abstract planning and reasoning in conjunction with motion planning. However, abstract …

Запазване Позоваване С позовавания в 43 Сродни статии Всички 8 версии

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning compositional neural programs with recursive tree search and planning

T Pierrot, G Ligner, SE Reed… - Advances in …, 2019 - proceedings.neurips.cc

We propose a novel reinforcement learning algorithm, AlphaNPI, that incorpo-rates the
strengths of Neural Programmer-Interpreters (NPI) and AlphaZero. NPI contributes structural …

Запазване Позоваване С позовавания в 46 Сродни статии Всички 9 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] science.org

Monte Carlo tree search with spectral expansion for planning with dynamical systems

B Rivière, J Lathrop, SJ Chung - Science Robotics, 2024 - science.org

The ability of a robot to plan complex behaviors with real-time computation, rather than
adhering to predesigned or offline-learned routines, alleviates the need for specialized …

Запазване Позоваване Сродни статии Всички 5 версии

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces

Z Huang, Q Liu, F Zhu, L Zhang, L Wu - Expert Systems with Applications, 2024 - Elsevier

The fundamental concept behind option-based hierarchical reinforcement learning (O-HRL)
is to obtain temporal coarse-grained actions and abstract complex situations. Although O …

Запазване Позоваване С позовавания в 5 Сродни статии Всички 2 версии

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Conditional abstraction trees for sample-efficient reinforcement learning

M Dadvar, RK Nayyar… - Uncertainty in Artificial …, 2023 - proceedings.mlr.press

In many real-world problems, the learning agent needs to learn a problem's abstractions and
solution simultaneously. However, most such abstractions need to be designed and refined …

Запазване Позоваване С позовавания в 7 Сродни статии Всички 11 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Accelerating monte carlo tree search with probability tree state abstraction

Y Fu, M Sun, B Nie, Y Gao - Advances in Neural …, 2023 - proceedings.neurips.cc

Abstract Monte Carlo Tree Search (MCTS) algorithms such as AlphaGo and MuZero have
achieved superhuman performance in many challenging tasks. However, the computational …

Запазване Позоваване С позовавания в 4 Сродни статии Всички 6 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-resolution POMDP planning for multi-object search in 3D

K Zheng, Y Sung, G Konidaris… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org

Robots operating in households must find objects on shelves, under tables, and in
cupboards. In such environments, it is crucial to search efficiently at 3D scale while co** …

Запазване Позоваване С позовавания в 26 Сродни статии Всички 13 версии

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Markovian State and Action Abstractions for MDPs via Hierarchical MCTS.

Value preserving state-action abstractions

Learning markov state abstractions for deep reinforcement learning

Decentralized cooperative planning for automated vehicles with hierarchical monte carlo tree search

Anytime integrated task and motion policies for stochastic environments

Learning compositional neural programs with recursive tree search and planning

Monte Carlo tree search with spectral expansion for planning with dynamical systems

Hierarchical reinforcement learning with unlimited option scheduling for sparse rewards in continuous spaces

Conditional abstraction trees for sample-efficient reinforcement learning

Accelerating monte carlo tree search with probability tree state abstraction

Multi-resolution POMDP planning for multi-object search in 3D