Symbolic task inference in deep reinforcement learning
This paper proposes DeepSynth, a method for effective training of deep reinforcement
learning agents when the reward is sparse or non-Markovian, but at the same time progress …
learning agents when the reward is sparse or non-Markovian, but at the same time progress …
Reinforcement learning with predefined and inferred reward machines in stochastic games
This paper focuses on Multi-Agent Reinforcement Learning (MARL) in non-cooperative
stochastic games, particularly addressing the challenge of task completion characterized by …
stochastic games, particularly addressing the challenge of task completion characterized by …
Translating omega-regular specifications to average objectives for model-free reinforcement learning
Recent success in reinforcement learning (RL) has brought renewed attention to the design
of reward functions by which agent behavior is reinforced or deterred. Manually designing …
of reward functions by which agent behavior is reinforced or deterred. Manually designing …
Regular Reinforcement Learning
In reinforcement learning, an agent incrementally refines a behavioral policy through a
series of episodic interactions with its environment. This process can be characterized as …
series of episodic interactions with its environment. This process can be characterized as …
Hierarchies of reward machines
Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …
reinforcement learning task through a finite-state machine whose edges encode subgoals of …
Inferring probabilistic reward machines from non-markovian reward signals for reinforcement learning
The success of reinforcement learning in typical settings is predicated on Markovian
assumptions on the reward signal by which an agent learns optimal policies. In recent years …
assumptions on the reward signal by which an agent learns optimal policies. In recent years …
Learning task automata for reinforcement learning using hidden Markov models
Training reinforcement learning (RL) agents using scalar reward signals is often infeasible
when an environment has sparse and non-Markovian rewards. Moreover, handcrafting …
when an environment has sparse and non-Markovian rewards. Moreover, handcrafting …
Learning Environment Models with Continuous Stochastic Dynamics
Solving control tasks in complex environments automatically through learning offers great
potential. While contemporary techniques from deep reinforcement learning (DRL) provide …
potential. While contemporary techniques from deep reinforcement learning (DRL) provide …
Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines
X Zheng, C Yu - arxiv preprint arxiv:2403.07005, 2024 - arxiv.org
In this paper, we study the cooperative Multi-Agent Reinforcement Learning (MARL)
problems using Reward Machines (RMs) to specify the reward functions such that the prior …
problems using Reward Machines (RMs) to specify the reward functions such that the prior …
Reinforcement learning under partial observability guided by learned environment models
Reinforcement learning and planning under partial observability is notoriously difficult. In
this setting, decision-making agents need to perform a sequence of actions with incomplete …
this setting, decision-making agents need to perform a sequence of actions with incomplete …