- Academic Search

E Groshev, M Goldstein, A Tamar… - Proceedings of the …, 2018 - ojs.aaai.org

We present a new approach to learning for planning, where knowledge acquired while
solving a given set of planning problems is used to plan faster in related, but new problem …

Enregistrer Citer Cité 141 fois Autres articles Les 15 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] mlr.press

What can i do here? a theory of affordances in reinforcement learning

K Khetarpal, Z Ahmed, G Comanici… - International …, 2020 - proceedings.mlr.press

Reinforcement learning algorithms usually assume that all actions are always available to
an agent. However, both people and animals understand the general link between the …

Enregistrer Citer Cité 78 fois Autres articles Les 6 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] aaai.org

Belief reward sha** in reinforcement learning

O Marom, B Rosman - Proceedings of the AAAI conference on artificial …, 2018 - ojs.aaai.org

A key challenge in many reinforcement learning problems is delayed rewards, which can
significantly slow down learning. Although reward sha** has previously been introduced …

Enregistrer Citer Cité 90 fois Autres articles Les 11 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

An empirical analysis of deep learning for cardinality estimation

J Ortiz, M Balazinska, J Gehrke, SS Keerthi - arxiv preprint arxiv …, 2019 - arxiv.org

We implement and evaluate deep learning for cardinality estimation by studying the
accuracy, space and time trade-offs across several architectures. We find that simple deep …

Enregistrer Citer Cité 73 fois Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Agent-agnostic human-in-the-loop reinforcement learning

D Abel, J Salvatier, A Stuhlmüller, O Evans - arxiv preprint arxiv …, 2017 - arxiv.org

Providing Reinforcement Learning agents with expert advice can dramatically improve
various aspects of learning. Prior work has developed teaching protocols that enable agents …

Enregistrer Citer Cité 86 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] aaai.org

Goal-based action priors

D Abel, D Hershkowitz, G Barth-Maron… - Proceedings of the …, 2015 - ojs.aaai.org

Robots that interact with people must flexibly respond to requests by planning in stochastic
state spaces that are often too large to solve for optimal behavior. In this work, we develop a …

Enregistrer Citer Cité 59 fois Autres articles Les 13 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] mlr.press

Hierarchy through composition with multitask LMDPs

AM Saxe, AC Earle, B Rosman - … Conference on Machine …, 2017 - proceedings.mlr.press

Hierarchical architectures are critical to the scalability of reinforcement learning methods.
Most current hierarchical frameworks execute actions serially, with macro-actions …

Enregistrer Citer Cité 46 fois Autres articles Les 10 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Solving hard AI planning instances using curriculum-driven deep reinforcement learning

D Feng, CP Gomes, B Selman - arxiv preprint arxiv:2006.02689, 2020 - arxiv.org

Despite significant progress in general AI planning, certain domains remain out of reach of
current AI planning systems. Sokoban is a PSPACE-complete planning task and represents …

Enregistrer Citer Cité 31 fois Autres articles Les 8 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] google.com

IRDA: Incremental reinforcement learning for dynamic resource allocation

J Wang, J Cao, S Wang, Z Yao… - IEEE Transactions on Big …, 2020 - ieeexplore.ieee.org

Resource allocation problems often manifest as online decision-making tasks where the
proper allocation strategy depends on the understanding of the allocation environment and …

Enregistrer Citer Cité 31 fois Autres articles Les 5 versions Free GPT-4

[Free GPT-4]

[PDF] aaai.org

No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning

D Zhong, Y Yang, Q Zhao - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

The large action space is one fundamental obstacle to deploying Reinforcement Learning
methods in the real world. The numerous redundant actions will cause the agents to make …

Enregistrer Citer Cité 6 fois Autres articles Les 3 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

What good are actions? Accelerating learning using learned action priors

Learning generalized reactive policies using deep neural networks

What can i do here? a theory of affordances in reinforcement learning

Belief reward sha** in reinforcement learning

An empirical analysis of deep learning for cardinality estimation

Agent-agnostic human-in-the-loop reinforcement learning

Goal-based action priors

Hierarchy through composition with multitask LMDPs

Solving hard AI planning instances using curriculum-driven deep reinforcement learning

IRDA: Incremental reinforcement learning for dynamic resource allocation

No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning