Learning generalized reactive policies using deep neural networks

E Groshev, M Goldstein, A Tamar… - Proceedings of the …, 2018 - ojs.aaai.org
We present a new approach to learning for planning, where knowledge acquired while
solving a given set of planning problems is used to plan faster in related, but new problem …

What can i do here? a theory of affordances in reinforcement learning

K Khetarpal, Z Ahmed, G Comanici… - International …, 2020 - proceedings.mlr.press
Reinforcement learning algorithms usually assume that all actions are always available to
an agent. However, both people and animals understand the general link between the …

Belief reward sha** in reinforcement learning

O Marom, B Rosman - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org
A key challenge in many reinforcement learning problems is delayed rewards, which can
significantly slow down learning. Although reward sha** has previously been introduced …

An empirical analysis of deep learning for cardinality estimation

J Ortiz, M Balazinska, J Gehrke, SS Keerthi - arxiv preprint arxiv …, 2019 - arxiv.org
We implement and evaluate deep learning for cardinality estimation by studying the
accuracy, space and time trade-offs across several architectures. We find that simple deep …

Agent-agnostic human-in-the-loop reinforcement learning

D Abel, J Salvatier, A Stuhlmüller, O Evans - arxiv preprint arxiv …, 2017 - arxiv.org
Providing Reinforcement Learning agents with expert advice can dramatically improve
various aspects of learning. Prior work has developed teaching protocols that enable agents …

Goal-based action priors

D Abel, D Hershkowitz, G Barth-Maron… - Proceedings of the …, 2015 - ojs.aaai.org
Robots that interact with people must flexibly respond to requests by planning in stochastic
state spaces that are often too large to solve for optimal behavior. In this work, we develop a …

Hierarchy through composition with multitask LMDPs

AM Saxe, AC Earle, B Rosman - … Conference on Machine …, 2017 - proceedings.mlr.press
Hierarchical architectures are critical to the scalability of reinforcement learning methods.
Most current hierarchical frameworks execute actions serially, with macro-actions …

Solving hard AI planning instances using curriculum-driven deep reinforcement learning

D Feng, CP Gomes, B Selman - arxiv preprint arxiv:2006.02689, 2020 - arxiv.org
Despite significant progress in general AI planning, certain domains remain out of reach of
current AI planning systems. Sokoban is a PSPACE-complete planning task and represents …

IRDA: Incremental reinforcement learning for dynamic resource allocation

J Wang, J Cao, S Wang, Z Yao… - IEEE Transactions on Big …, 2020 - ieeexplore.ieee.org
Resource allocation problems often manifest as online decision-making tasks where the
proper allocation strategy depends on the understanding of the allocation environment and …

No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning

D Zhong, Y Yang, Q Zhao - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
The large action space is one fundamental obstacle to deploying Reinforcement Learning
methods in the real world. The numerous redundant actions will cause the agents to make …