Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Reward machines: Exploiting reward function structure in reinforcement learning
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …
such, these methods must extensively interact with the environment in order to discover …
On the expressivity of markov reward
Reward is the driving force for reinforcement-learning agents. This paper is dedicated to
understanding the expressivity of reward as a way to capture tasks that we would want an …
understanding the expressivity of reward as a way to capture tasks that we would want an …
Compositional reinforcement learning from logical specifications
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …
specifications. Recent approaches automatically generate a reward function from a given …
Automated verification and synthesis of stochastic hybrid systems: A survey
Stochastic hybrid systems have received significant attentions as a relevant modeling
framework describing many systems, from engineering to the life sciences: they enable the …
framework describing many systems, from engineering to the life sciences: they enable the …
(Ir) rationality in AI: State of the Art, Research Challenges and Open Questions
The concept of rationality is central to the field of artificial intelligence. Whether we are
seeking to simulate human reasoning, or the goal is to achieve bounded optimality, we …
seeking to simulate human reasoning, or the goal is to achieve bounded optimality, we …
Reinforcement learning with knowledge representation and reasoning: A brief survey
C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arxiv preprint arxiv:2304.12090, 2023 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …
still faces significant obstacles in addressing complex real-life problems due to the issues of …
Instructing goal-conditioned reinforcement learning agents with temporal logic objectives
W Qiu, W Mao, H Zhu - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Goal-conditioned reinforcement learning (RL) is a powerful approach for learning general-
purpose skills by reaching diverse goals. However, it has limitations when it comes to task …
purpose skills by reaching diverse goals. However, it has limitations when it comes to task …
On the limitations of markovian rewards to express multi-objective, risk-sensitive, and modal tasks
In this paper, we study the expressivity of scalar, Markovian reward functions in
Reinforcement Learning (RL), and identify several limitations to what they can express …
Reinforcement Learning (RL), and identify several limitations to what they can express …
Certified reinforcement learning with logic guidance
Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …
been applied to a variety of control problems. However, applications in safety-critical …
Preferential cyber defense for power grids
The integration of computing and communication capabilities into the power grid has led to
vulnerabilities enabling attackers to launch cyberattacks on the grid. The resources that can …
vulnerabilities enabling attackers to launch cyberattacks on the grid. The resources that can …