Академия Google

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

Сохранить Цитировать Цитируется: 260 Похожие статьи Все версии статьи (11) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

On the expressivity of markov reward

D Abel, W Dabney, A Harutyunyan… - Advances in …, 2021 - proceedings.neurips.cc

Reward is the driving force for reinforcement-learning agents. This paper is dedicated to
understanding the expressivity of reward as a way to capture tasks that we would want an …

Сохранить Цитировать Цитируется: 109 Похожие статьи Все версии статьи (15) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Сохранить Цитировать Цитируется: 99 Похожие статьи Все версии статьи (16) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Automated verification and synthesis of stochastic hybrid systems: A survey

A Lavaei, S Soudjani, A Abate, M Zamani - Automatica, 2022 - Elsevier

Stochastic hybrid systems have received significant attentions as a relevant modeling
framework describing many systems, from engineering to the life sciences: they enable the …

Сохранить Цитировать Цитируется: 119 Похожие статьи Все версии статьи (12)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

(Ir) rationality in AI: State of the Art, Research Challenges and Open Questions

O Macmillan-Scott, M Musolesi - arxiv preprint arxiv:2311.17165, 2023 - arxiv.org

The concept of rationality is central to the field of artificial intelligence. Whether we are
seeking to simulate human reasoning, or the goal is to achieve bounded optimality, we …

Сохранить Цитировать Цитируется: 6 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning with knowledge representation and reasoning: A brief survey

C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arxiv preprint arxiv:2304.12090, 2023 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …

Сохранить Цитировать Цитируется: 9 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Instructing goal-conditioned reinforcement learning agents with temporal logic objectives

W Qiu, W Mao, H Zhu - Advances in Neural Information …, 2023 - proceedings.neurips.cc

Goal-conditioned reinforcement learning (RL) is a powerful approach for learning general-
purpose skills by reaching diverse goals. However, it has limitations when it comes to task …

Сохранить Цитировать Цитируется: 9 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

On the limitations of markovian rewards to express multi-objective, risk-sensitive, and modal tasks

J Skalse, A Abate - Uncertainty in Artificial Intelligence, 2023 - proceedings.mlr.press

In this paper, we study the expressivity of scalar, Markovian reward functions in
Reinforcement Learning (RL), and identify several limitations to what they can express …

Сохранить Цитировать Цитируется: 13 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Certified reinforcement learning with logic guidance

H Hasanbeig, D Kroening, A Abate - Artificial Intelligence, 2023 - Elsevier

Reinforcement Learning (RL) is a widely employed machine learning architecture that has
been applied to a variety of control problems. However, applications in safety-critical …

Сохранить Цитировать Цитируется: 69 Похожие статьи Все версии статьи (12)

[Free GPT-4]
[DeepSeek]

[PDF] aps.org

Preferential cyber defense for power grids

M Moradi, Y Weng, J Dirkman, YC Lai - PRX Energy, 2023 - APS

The integration of computing and communication capabilities into the power grid has led to
vulnerabilities enabling attackers to launch cyberattacks on the grid. The resources that can …

Сохранить Цитировать Цитируется: 11 Похожие статьи Все версии статьи (8)

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Multi-agent reinforcement learning with temporal logic specifications

Reward machines: Exploiting reward function structure in reinforcement learning

On the expressivity of markov reward

Compositional reinforcement learning from logical specifications

Automated verification and synthesis of stochastic hybrid systems: A survey

(Ir) rationality in AI: State of the Art, Research Challenges and Open Questions

Reinforcement learning with knowledge representation and reasoning: A brief survey

Instructing goal-conditioned reinforcement learning agents with temporal logic objectives

On the limitations of markovian rewards to express multi-objective, risk-sensitive, and modal tasks

Certified reinforcement learning with logic guidance

Preferential cyber defense for power grids