- Academic Search

Active finite reward automaton inference and reinforcement learning using queries and counterexam...

H Hasanbeig, NY Jeppu, A Abate, T Melham… - Journal of Artificial …, 2024 - jair.org

This paper proposes DeepSynth, a method for effective training of deep reinforcement
learning agents when the reward is sparse or non-Markovian, but at the same time progress …

Salva Cita Citato da 4 Articoli correlati Tutte e 3 le versioni Versione HTML

Reinforcement learning with predefined and inferred reward machines in stochastic games

J Hu, Y Paliwal, H Kim, Y Wang, Z Xu - Neurocomputing, 2024 - Elsevier

This paper focuses on Multi-Agent Reinforcement Learning (MARL) in non-cooperative
stochastic games, particularly addressing the challenge of task completion characterized by …

Salva Cita Citato da 2 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]

[PDF] nsf.gov

Translating omega-regular specifications to average objectives for model-free reinforcement learning

M Kazemi, M Perez, F Somenzi, S Soudjani… - Proc. of the 21st …, 2022 - par.nsf.gov

Recent success in reinforcement learning (RL) has brought renewed attention to the design
of reward functions by which agent behavior is reinforced or deterred. Manually designing …

Salva Cita Citato da 16 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]

[PDF] springer.com

Regular Reinforcement Learning

T Dohmen, M Perez, F Somenzi, A Trivedi - International Conference on …, 2024 - Springer

In reinforcement learning, an agent incrementally refines a behavioral policy through a
series of episodic interactions with its environment. This process can be characterized as …

Salva Cita Citato da 2 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] mlr.press

Hierarchies of reward machines

D Furelos-Blanco, M Law, A Jonsson… - International …, 2023 - proceedings.mlr.press

Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …

Salva Cita Citato da 16 Articoli correlati Tutte e 9 le versioni Versione HTML

[Free GPT-4]

[PDF] aaai.org

Inferring probabilistic reward machines from non-markovian reward signals for reinforcement learning

T Dohmen, N Topper, G Atia, A Beckus… - Proceedings of the …, 2022 - ojs.aaai.org

The success of reinforcement learning in typical settings is predicated on Markovian
assumptions on the reward signal by which an agent learns optimal policies. In recent years …

Salva Cita Citato da 22 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]

[PDF] iospress.nl

Learning task automata for reinforcement learning using hidden Markov models

A Abate, Y Almulla, J Fox, D Hyland, M Wooldridge - ECAI 2023, 2023 - ebooks.iospress.nl

Training reinforcement learning (RL) agents using scalar reward signals is often infeasible
when an environment has sparse and non-Markovian rewards. Moreover, handcrafting …

Salva Cita Citato da 7 Articoli correlati Tutte e 4 le versioni

[Free GPT-4]

[PDF] arxiv.org

Learning Environment Models with Continuous Stochastic Dynamics

M Tappler, E Muškardin, BK Aichernig… - arxiv preprint arxiv …, 2023 - arxiv.org

Solving control tasks in complex environments automatically through learning offers great
potential. While contemporary techniques from deep reinforcement learning (DRL) provide …

Salva Cita Citato da 2 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines

X Zheng, C Yu - arxiv preprint arxiv:2403.07005, 2024 - arxiv.org

In this paper, we study the cooperative Multi-Agent Reinforcement Learning (MARL)
problems using Reward Machines (RMs) to specify the reward functions such that the prior …

Salva Cita Citato da 2 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Reinforcement learning under partial observability guided by learned environment models

E Muškardin, M Tappler, BK Aichernig, I Pill - International Conference on …, 2023 - Springer

Reinforcement learning and planning under partial observability is notoriously difficult. In
this setting, decision-making agents need to perform a sequence of actions with incomplete …

Salva Cita Citato da 9 Articoli correlati Tutte e 11 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Active finite reward automaton inference and reinforcement learning using queries and counterexam...

Symbolic task inference in deep reinforcement learning

Reinforcement learning with predefined and inferred reward machines in stochastic games

Translating omega-regular specifications to average objectives for model-free reinforcement learning

Regular Reinforcement Learning

Hierarchies of reward machines

Inferring probabilistic reward machines from non-markovian reward signals for reinforcement learning

Learning task automata for reinforcement learning using hidden Markov models

Learning Environment Models with Continuous Stochastic Dynamics

Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines

Reinforcement learning under partial observability guided by learned environment models