Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A review of safe reinforcement learning: Methods, theory and applications
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …
making tasks. However, safety concerns are raised during deploying RL in real-world …
Advances in trajectory optimization for space vehicle control
Abstract Space mission design places a premium on cost and operational efficiency. The
search for new science and life beyond Earth calls for spacecraft that can deliver scientific …
search for new science and life beyond Earth calls for spacecraft that can deliver scientific …
Autonomous drone racing: A survey
Over the last decade, the use of autonomous drone systems for surveying, search and
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …
rescue, or last-mile delivery has increased exponentially. With the rise of these applications …
A lyapunov-based approach to safe reinforcement learning
In many real-world reinforcement learning (RL) problems, besides optimizing the main
objective function, an agent must concurrently avoid violating a number of constraints. In …
objective function, an agent must concurrently avoid violating a number of constraints. In …
Batch policy learning under constraints
When learning policies for real-world domains, two important questions arise:(i) how to
efficiently use pre-collected off-policy, non-optimal behavior data; and (ii) how to mediate …
efficiently use pre-collected off-policy, non-optimal behavior data; and (ii) how to mediate …
Natural policy gradient primal-dual method for constrained markov decision processes
We study sequential decision-making problems in which each agent aims to maximize the
expected total reward while satisfying a constraint on the expected total utility. We employ …
expected total reward while satisfying a constraint on the expected total utility. We employ …
Risk-constrained reinforcement learning with percentile risk criteria
In many sequential decision-making problems one is interested in minimizing an expected
cumulative cost while taking into account risk, ie, increased awareness of events of small …
cumulative cost while taking into account risk, ie, increased awareness of events of small …
Step: Stochastic traversability evaluation and planning for risk-aware off-road navigation
Although ground robotic autonomy has gained widespread usage in structured and
controlled environments, autonomy in unknown and off-road terrain remains a difficult …
controlled environments, autonomy in unknown and off-road terrain remains a difficult …
How should a robot assess risk? towards an axiomatic theory of risk in robotics
Endowing robots with the capability of assessing risk and making risk-aware decisions is
widely considered a key step toward ensuring safety for robots operating under uncertainty …
widely considered a key step toward ensuring safety for robots operating under uncertainty …
Safe exploration and optimization of constrained mdps using gaussian processes
We present a reinforcement learning approach to explore and optimize a safety-constrained
Markov Decision Process (MDP). In this setting, the agent must maximize discounted …
Markov Decision Process (MDP). In this setting, the agent must maximize discounted …