A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Advances in trajectory optimization for space vehicle control

D Malyuta, Y Yu, P Elango, B Açıkmeşe - Annual Reviews in Control, 2021 - Elsevier
Abstract Space mission design places a premium on cost and operational efficiency. The
search for new science and life beyond Earth calls for spacecraft that can deliver scientific …

Autonomous drone racing: A survey

D Hanover, A Loquercio, L Bauersfeld… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Over the last decade, the use of autonomous drone systems for surveying, search and
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …

A lyapunov-based approach to safe reinforcement learning

Y Chow, O Nachum… - Advances in neural …, 2018 - proceedings.neurips.cc
In many real-world reinforcement learning (RL) problems, besides optimizing the main
objective function, an agent must concurrently avoid violating a number of constraints. In …

Batch policy learning under constraints

H Le, C Voloshin, Y Yue - International Conference on …, 2019 - proceedings.mlr.press
When learning policies for real-world domains, two important questions arise:(i) how to
efficiently use pre-collected off-policy, non-optimal behavior data; and (ii) how to mediate …

Natural policy gradient primal-dual method for constrained markov decision processes

D Ding, K Zhang, T Basar… - Advances in Neural …, 2020 - proceedings.neurips.cc
We study sequential decision-making problems in which each agent aims to maximize the
expected total reward while satisfying a constraint on the expected total utility. We employ …

Risk-constrained reinforcement learning with percentile risk criteria

Y Chow, M Ghavamzadeh, L Janson… - Journal of Machine …, 2018 - jmlr.org
In many sequential decision-making problems one is interested in minimizing an expected
cumulative cost while taking into account risk, ie, increased awareness of events of small …

Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation

DD Fan, K Otsu, Y Kubo, A Dixit, J Burdick… - arxiv preprint arxiv …, 2021 - arxiv.org
Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …

How should a robot assess risk? towards an axiomatic theory of risk in robotics

A Majumdar, M Pavone - Robotics Research: The 18th International …, 2020 - Springer
Endowing robots with the capability of assessing risk and making risk-aware decisions is
widely considered a key step toward ensuring safety for robots operating under uncertainty …

Safe exploration and optimization of constrained mdps using gaussian processes

A Wachi, Y Sui, Y Yue, M Ono - Proceedings of the AAAI Conference on …, 2018 - ojs.aaai.org
We present a reinforcement learning approach to explore and optimize a safety-constrained
Markov Decision Process (MDP). In this setting, the agent must maximize discounted …