Reward machines: Exploiting reward function structure in reinforcement learning
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …
such, these methods must extensively interact with the environment in order to discover …
Control barrier functions for signal temporal logic tasks
L Lindemann, DV Dimarogonas - IEEE control systems letters, 2018 - ieeexplore.ieee.org
The need for computationally-efficient control methods of dynamical systems under temporal
logic tasks has recently become more apparent. Existing methods are computationally …
logic tasks has recently become more apparent. Existing methods are computationally …
[BOK][B] Formal methods for discrete-time dynamical systems
In control theory, complex models of physical processes, such as systems of differential or
difference equations, are usually checked against simple specifications, such as stability …
difference equations, are usually checked against simple specifications, such as stability …
Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges
In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …
systems. By employing mathematically rigorous techniques, formal methods can provide …
A survey on interpretable reinforcement learning
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …
for sequential decision-making problems, it is still not mature enough for high-stake domains …
Compositional reinforcement learning from logical specifications
K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …
specifications. Recent approaches automatically generate a reward function from a given …
Reinforcement learning with temporal logic rewards
Reinforcement learning (RL) depends critically on the choice of reward functions used to
capture the desired behavior and constraints of a robot. Usually, these are handcrafted by a …
capture the desired behavior and constraints of a robot. Usually, these are handcrafted by a …
Control synthesis from linear temporal logic specifications using model-free reinforcement learning
We present a reinforcement learning (RL) frame-work to synthesize a control policy from a
given linear temporal logic (LTL) specification in an unknown stochastic environment that …
given linear temporal logic (LTL) specification in an unknown stochastic environment that …
Modular deep reinforcement learning for continuous motion planning with temporal logic
This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …
Markov decision processes (MDP) with unknown transition probabilities over continuous …
Ltl2action: Generalizing ltl instructions for multi-task rl
We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …
instructions in multi-task environments. Instructions are expressed in a well-known formal …