Google znalac

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Spremi Citiraj Spominje se 301 puta Srodni članci Svih 3 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

How to certify machine learning based safety-critical systems? A systematic literature review

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer

Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

Spremi Citiraj Spominje se 85 puta Srodni članci Svih 9 inačica

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

NNV: the neural network verification tool for deep neural networks and learning-enabled cyber-physical systems

HD Tran, X Yang, D Manzanas Lopez, P Musau… - … conference on computer …, 2020 - Springer

This paper presents the Neural Network Verification (NNV) software tool, a set-based
verification framework for deep neural networks (DNNs) and learning-enabled cyber …

Spremi Citiraj Spominje se 323 puta Srodni članci Svih 18 inačica

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Safe-state enhancement method for autonomous driving via direct hierarchical reinforcement learning

Z Gu, L Gao, H Ma, SE Li, S Zheng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Reinforcement learning (RL) has shown excellent performance in the sequential decision-
making problem, where safety in the form of state constraints is of great significance in the …

Spremi Citiraj Spominje se 36 puta Srodni članci Svih 5 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

M Hasanbeig, Y Kantaros, A Abate… - 2019 IEEE 58th …, 2019 - ieeexplore.ieee.org

We present a model-free reinforcement learning algorithm to synthesize control policies that
maximize the probability of satisfying high-level control objectives given as Linear Temporal …

Spremi Citiraj Spominje se 154 puta Srodni članci Svih 12 inačica

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Neurosymbolic reinforcement learning with formally verified exploration

G Anderson, A Verma, I Dillig… - Advances in neural …, 2020 - proceedings.neurips.cc

We present REVEL, a partially neural reinforcement learning (RL) framework for provably
safe exploration in continuous state and action spaces. A key challenge for provably safe …

Spremi Citiraj Spominje se 100 puta Srodni članci Svih 11 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cautious reinforcement learning with logical constraints

M Hasanbeig, A Abate, D Kroening - arxiv preprint arxiv:2002.12156, 2020 - arxiv.org

This paper presents the concept of an adaptive safe padding that forces Reinforcement
Learning (RL) to synthesise optimal control policies while ensuring safety during the …

Spremi Citiraj Spominje se 107 puta Srodni članci Svih 8 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Safe reinforcement learning via probabilistic shields

N Jansen, B Könighofer, S Junges, AC Serban… - arxiv preprint arxiv …, 2018 - arxiv.org

This paper targets the efficient construction of a safety shield for decision making in
scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent …

Spremi Citiraj Spominje se 136 puta Srodni članci Svih 11 inačica Pretraživanje knjižnica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] tum.de

Safe reinforcement learning for autonomous lane changing using set-based prediction

H Krasowski, X Wang, M Althoff - 2020 IEEE 23rd international …, 2020 - ieeexplore.ieee.org

Machine learning approaches often lack safety guarantees, which are often a key
requirement in real-world tasks. This paper addresses the lack of safety guarantees by …

Spremi Citiraj Spominje se 94 puta Srodni članci Svih 6 inačica

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Enforcing policy feasibility constraints through differentiable projection for energy optimization

B Chen, PL Donti, K Baker, JZ Kolter… - Proceedings of the Twelfth …, 2021 - dl.acm.org

While reinforcement learning (RL) is gaining popularity in energy systems control, its real-
world applications are limited due to the fact that the actions from learned policies may not …

Spremi Citiraj Spominje se 64 puta Srodni članci Svih 6 inačica

Stvori obavijest

Citiraj

Napredno pretraživanje

Spremljeno u Moju knjižnicu

Verifiably safe off-model reinforcement learning

A review of safe reinforcement learning: Methods, theory and applications

How to certify machine learning based safety-critical systems? A systematic literature review

NNV: the neural network verification tool for deep neural networks and learning-enabled cyber-physical systems

Safe-state enhancement method for autonomous driving via direct hierarchical reinforcement learning

Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction guarantees

Neurosymbolic reinforcement learning with formally verified exploration

Cautious reinforcement learning with logical constraints

Safe reinforcement learning via probabilistic shields

Safe reinforcement learning for autonomous lane changing using set-based prediction

Enforcing policy feasibility constraints through differentiable projection for energy optimization