Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

M Hasanbeig, Y Kantaros, A Abate… - 2019 IEEE 58th …, 2019 - ieeexplore.ieee.org
We present a model-free reinforcement learning algorithm to synthesize control policies that
maximize the probability of satisfying high-level control objectives given as Linear Temporal …

Omega-regular objectives in model-free reinforcement learning

EM Hahn, M Perez, S Schewe, F Somenzi… - … conference on tools and …, 2019 - Springer
We provide the first solution for model-free reinforcement learning of ω-regular objectives for
Markov decision processes (MDPs). We present a constructive reduction from the almost …

Logically-constrained reinforcement learning

M Hasanbeig, A Abate, D Kroening - arxiv preprint arxiv:1801.08099, 2018 - arxiv.org
We present the first model-free Reinforcement Learning (RL) algorithm to synthesise
policies for an unknown Markov Decision Process (MDP), such that a linear time property is …

Deep reinforcement learning with temporal logics

M Hasanbeig, D Kroening, A Abate - … and Analysis of Timed Systems: 18th …, 2020 - Springer
The combination of data-driven learning methods with formal reasoning has seen a surge of
interest, as either area has the potential to bolstering the other. For instance, formal methods …

Owl: A library for-words, automata, and LTL

J Křetínský, T Meggendorfer, S Sickert - International Symposium on …, 2018 - Springer
We present the library Owl (O mega-W ords, automata, and L TL) for ω-automata and linear
temporal logic. It forms a backbone of several translations from LTL to automata and related …

Semantic traffic law adaptive decision-making for self-driving vehicles

J Liu, H Wang, Z Cao, W Yu, C Zhao… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Facts proved that obeying traffic laws keeps the promise to promote the safety of self-driving
vehicles. Current self-driving vehicles usually have fixed algorithms during autonomous …

Certified reinforcement learning with logic guidance

H Hasanbeig, D Kroening, A Abate - Artificial Intelligence, 2023 - Elsevier
Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …

Policy synthesis and reinforcement learning for discounted ltl

R Alur, O Bastani, K Jothimurugan, M Perez… - … on Computer Aided …, 2023 - Springer
The difficulty of manually specifying reward functions has led to an interest in using linear
temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL …

Modular deep reinforcement learning with temporal logic specifications

LZ Yuan, M Hasanbeig, A Abate, D Kroening - arxiv preprint arxiv …, 2019 - arxiv.org
We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework
for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward …

LCRL: Certified policy synthesis via logically-constrained reinforcement learning

M Hasanbeig, D Kroening, A Abate - International Conference on …, 2022 - Springer
LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms
over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given …