Академия Google

S Holt, A Hüyük… - Advances in Neural …, 2024 - proceedings.neurips.cc

The control of continuous-time environments while actively deciding when to take costly
observations in time is a crucial yet unexplored problem, particularly relevant to real-world …

Сохранить Цитировать Цитируется: 7 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]

[PDF] arxiv.org

An Idiosyncrasy of Time-discretization in Reinforcement Learning

K De Asis, RS Sutton - arxiv preprint arxiv:2406.14951, 2024 - arxiv.org

Many reinforcement learning algorithms are built on an assumption that an agent interacts
with an environment over fixed-duration, discrete time steps. However, physical systems are …

Сохранить Цитировать Цитируется: 1 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]

[PDF] openreview.net

Mitigating the curse of horizon in Monte-Carlo returns

A Ayoub, D Szepesvari, F Zanini, B Chan… - Reinforcement …, 2024 - openreview.net

The standard framework in reinforcement learning (RL) dictates that an agent should use
every observation collected from interactions with the environment when updating its value …

Сохранить Цитировать Цитируется: 1 Похожие статьи Все версии статьи (2) В виде HTML

[Free GPT-4]

[PDF] unipd.it

Reinforcement Learning Through the Lens of Koopman Operators-The Infinite-Dimensional Framework

F Zanini - 2023 - research.unipd.it

Reinforcement Learning is nowadays one of the most active research areas in Artificial
Intelligence. This is due both to its practical successes, and to the theoretical challenges it …

Сохранить Цитировать Похожие статьи В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Managing temporal resolution in continuous value estimation: A fundamental trade-off

Active observing in continuous-time control

An Idiosyncrasy of Time-discretization in Reinforcement Learning

Mitigating the curse of horizon in Monte-Carlo returns

Reinforcement Learning Through the Lens of Koopman Operators-The Infinite-Dimensional Framework