Safe reinforcement learning via probabilistic shields

N Jansen, B Könighofer, S Junges, AC Serban… - arxiv preprint arxiv …, 2018 - arxiv.org
This paper targets the efficient construction of a safety shield for decision making in
scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent …

A composable specification language for reinforcement learning tasks

K Jothimurugan, R Alur… - Advances in Neural …, 2019 - proceedings.neurips.cc
Reinforcement learning is a promising approach for learning control policies for robot tasks.
However, specifying complex tasks (eg, with multiple objectives and safety constraints) can …

The mu-calculus and Model Checking

J Bradfield, I Walukiewicz - Handbook of Model Checking, 2018 - Springer
This chapter presents that part of the theory of the μ μ-calculus that is relevant to the model-
checking problem as broadly understood. The μ μ-calculus is one of the most important …

Practical synthesis of reactive systems from LTL specifications via parity games: You can teach an old dog new tricks: making a classic approach structured, forward …

M Luttenberger, PJ Meyer, S Sickert - Acta Informatica, 2020 - Springer
The synthesis of reactive systems from linear temporal logic (LTL) specifications is an
important aspect in the design of reliable software and hardware. We present our adaption …

Decentralized control synthesis for air traffic management in urban air mobility

S Bharadwaj, S Carr, N Neogi… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Urban air mobility (UAM) refers to air transportation services within an urban area, often in
an on-demand fashion. We study air traffic management (ATM) for vehicles in a UAM fleet …

Solving infinite-state games via acceleration

P Heim, R Dimitrova - Proceedings of the ACM on Programming …, 2024 - dl.acm.org
Two-player graph games have found numerous applications, most notably in the synthesis
of reactive systems from temporal specifications, but also in verification. The relevance of …

Run-time optimization for learned controllers through quantitative games

G Avni, R Bloem, K Chatterjee, TA Henzinger… - … Aided Verification: 31st …, 2019 - Springer
A controller is a device that interacts with a plant. At each time point, it reads the plant's state
and issues commands with the goal that the plant operates optimally. Constructing optimal …

Temporal logic and fair discrete systems

N Piterman, A Pnueli - Handbook of Model Checking, 2018 - Springer
Temporal logic has been used by philosophers to reason about the way the world changes
over time. Its modern use in specification and verification of systems describes the evolution …

[PDF][PDF] Reactive synthesis modulo theories using abstraction refinement

B Maderbacher, R Bloem - # …, 2022 - library.oapen.org
Reactive synthesis builds a system from a specification given as a temporal logic formula.
Traditionally, reactive synthesis is defined for systems with Boolean input and output …

Integrated resource allocation and strategy synthesis in safety games on graphs with deception

AN Kulkarni, MS Cohen, CA Kamhoua, J Fu - arxiv preprint arxiv …, 2024 - arxiv.org
Deception plays a crucial role in strategic interactions with incomplete information. Motivated
by security applications, we study a class of two-player turn-based deterministic games with …