- Academic Search

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

M Hasanbeig, Y Kantaros, A Abate… - 2019 IEEE 58th …, 2019 - ieeexplore.ieee.org

We present a model-free reinforcement learning algorithm to synthesize control policies that
maximize the probability of satisfying high-level control objectives given as Linear Temporal …

Uložit Citovat Počet citací tohoto článku: 154 Související články Všechny verze (počet: 12)

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Omega-regular objectives in model-free reinforcement learning

EM Hahn, M Perez, S Schewe, F Somenzi… - … conference on tools and …, 2019 - Springer

We provide the first solution for model-free reinforcement learning of ω-regular objectives for
Markov decision processes (MDPs). We present a constructive reduction from the almost …

Uložit Citovat Počet citací tohoto článku: 177 Související články Všechny verze (počet: 18)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Logically-constrained reinforcement learning

M Hasanbeig, A Abate, D Kroening - arxiv preprint arxiv:1801.08099, 2018 - arxiv.org

We present the first model-free Reinforcement Learning (RL) algorithm to synthesise
policies for an unknown Markov Decision Process (MDP), such that a linear time property is …

Uložit Citovat Počet citací tohoto článku: 128 Související články Všechny verze (počet: 6) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] ox.ac.uk

Deep reinforcement learning with temporal logics

M Hasanbeig, D Kroening, A Abate - … and Analysis of Timed Systems: 18th …, 2020 - Springer

The combination of data-driven learning methods with formal reasoning has seen a surge of
interest, as either area has the potential to bolstering the other. For instance, formal methods …

Uložit Citovat Počet citací tohoto článku: 78 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] tum.de

Owl: A library for-words, automata, and LTL

J Křetínský, T Meggendorfer, S Sickert - International Symposium on …, 2018 - Springer

We present the library Owl (O mega-W ords, automata, and L TL) for ω-automata and linear
temporal logic. It forms a backbone of several translations from LTL to automata and related …

Uložit Citovat Počet citací tohoto článku: 103 Související články Všechny verze (počet: 3)

Semantic traffic law adaptive decision-making for self-driving vehicles

J Liu, H Wang, Z Cao, W Yu, C Zhao… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

Facts proved that obeying traffic laws keeps the promise to promote the safety of self-driving
vehicles. Current self-driving vehicles usually have fixed algorithms during autonomous …

Uložit Citovat Počet citací tohoto článku: 12 Související články Všechny verze (počet: 4)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Certified reinforcement learning with logic guidance

H Hasanbeig, D Kroening, A Abate - Artificial Intelligence, 2023 - Elsevier

Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …

Uložit Citovat Počet citací tohoto článku: 70 Související články Všechny verze (počet: 12)

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Policy synthesis and reinforcement learning for discounted ltl

R Alur, O Bastani, K Jothimurugan, M Perez… - … on Computer Aided …, 2023 - Springer

The difficulty of manually specifying reward functions has led to an interest in using linear
temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL …

Uložit Citovat Počet citací tohoto článku: 14 Související články Všechny verze (počet: 8)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Modular deep reinforcement learning with temporal logic specifications

LZ Yuan, M Hasanbeig, A Abate, D Kroening - arxiv preprint arxiv …, 2019 - arxiv.org

We propose an actor-critic, model-free, and online Reinforcement Learning (RL) framework
for continuous-state continuous-action Markov Decision Processes (MDPs) when the reward …

Uložit Citovat Počet citací tohoto článku: 48 Související články Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

LCRL: Certified policy synthesis via logically-constrained reinforcement learning

M Hasanbeig, D Kroening, A Abate - International Conference on …, 2022 - Springer

LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms
over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given …

Uložit Citovat Počet citací tohoto článku: 19 Související články Všechny verze (počet: 6)

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

MoChiBA: Probabilistic LTL model checking using limit-deterministic Büchi automata

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

Omega-regular objectives in model-free reinforcement learning

Logically-constrained reinforcement learning

Deep reinforcement learning with temporal logics

Owl: A library for-words, automata, and LTL

Semantic traffic law adaptive decision-making for self-driving vehicles

Certified reinforcement learning with logic guidance

Policy synthesis and reinforcement learning for discounted ltl

Modular deep reinforcement learning with temporal logic specifications

LCRL: Certified policy synthesis via logically-constrained reinforcement learning