Google Akademik

K Murphy - arxiv preprint arxiv:2412.05265, 2024 - arxiv.org

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement
learning and sequential decision making, covering value-based RL, policy-gradient …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Deep laplacian-based options for temporally-extended exploration

M Klissarov, MC Machado - arxiv preprint arxiv:2301.11181, 2023 - arxiv.org

Selecting exploratory actions that generate a rich stream of experience for better learning is
a fundamental challenge in reinforcement learning (RL). An approach to tackle this problem …

Kaydet Alıntı yap Alıntılanma sayısı: 20 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] mlr.press

Timing as an Action: Learning When to Observe and Act

H Zhou, A Huang, K Azizzadenesheli… - International …, 2024 - proceedings.mlr.press

In standard reinforcement learning setups, the agent receives observations and performs
actions at evenly spaced intervals. However, in many real-world settings, observations are …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler HTML olarak görüntüle

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Reward-respecting subtasks for model-based reinforcement learning

RS Sutton, MC Machado, GZ Holland, D Szepesvari… - Artificial Intelligence, 2023 - Elsevier

To achieve the ambitious goals of artificial intelligence, reinforcement learning must include
planning with a model of the world that is abstract in state and time. Deep learning has made …

Kaydet Alıntı yap Alıntılanma sayısı: 25 İlgili makaleler 9 sürümün hepsi

[Free GPT-4]

[PDF] arxiv.org

Reasoning with latent diffusion in offline reinforcement learning

S Venkatraman, S Khaitan, RT Akella, J Dolan… - arxiv preprint arxiv …, 2023 - arxiv.org

Offline reinforcement learning (RL) holds promise as a means to learn high-reward policies
from a static dataset, without the need for further environment interactions. However, a key …

Kaydet Alıntı yap Alıntılanma sayısı: 18 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] arxiv.org

Artificial general intelligence (AGI)-native wireless systems: A journey beyond 6G

W Saad, O Hashash, CK Thomas, C Chaccour… - arxiv preprint arxiv …, 2024 - arxiv.org

Building future wireless systems that support services like digital twins (DTs) is challenging
to achieve through advances to conventional technologies like meta-surfaces. While artificial …

Kaydet Alıntı yap Alıntılanma sayısı: 17 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] neurips.cc

Multi-step generalized policy improvement by leveraging approximate models

LN Alegre, A Bazzan, A Nowé… - Advances in Neural …, 2024 - proceedings.neurips.cc

We introduce a principled method for performing zero-shot transfer in reinforcement learning
(RL) by exploiting approximate models of the environment. Zero-shot transfer in RL has …

Kaydet Alıntı yap Alıntılanma sayısı: 3 İlgili makaleler 5 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] neurips.cc

Provably (more) sample-efficient offline RL with options

X Hu, H Leung - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc

The options framework yields empirical success in long-horizon planning problems of
reinforcement learning (RL). Recent works show that options help improve the sample …

Kaydet Alıntı yap İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]

[PDF] ssrn.com

Scenario-level knowledge transfer for motion planning of autonomous driving via successor representation

H Lu, C Lu, H Wang, J Gong, M Zhu, H Yang - Transportation Research Part …, 2024 - Elsevier

For autonomous vehicles, transfer learning can enhance performance by making better use
of previously learned knowledge in newly encountered scenarios, which holds great …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler

[Free GPT-4]

[PDF] arxiv.org

When does self-prediction help? understanding auxiliary tasks in reinforcement learning

C Voelcker, T Kastner, I Gilitschenski… - arxiv preprint arxiv …, 2024 - arxiv.org

We investigate the impact of auxiliary learning tasks such as observation reconstruction and
latent self-prediction on the representation learning problem in reinforcement learning. We …

Kaydet Alıntı yap Alıntılanma sayısı: 1 İlgili makaleler 7 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Temporal abstraction in reinforcement learning with the successor representation

Reinforcement learning: An overview

Deep laplacian-based options for temporally-extended exploration

Timing as an Action: Learning When to Observe and Act

[HTML][HTML] Reward-respecting subtasks for model-based reinforcement learning

Reasoning with latent diffusion in offline reinforcement learning

Artificial general intelligence (AGI)-native wireless systems: A journey beyond 6G

Multi-step generalized policy improvement by leveraging approximate models

Provably (more) sample-efficient offline RL with options

Scenario-level knowledge transfer for motion planning of autonomous driving via successor representation

When does self-prediction help? understanding auxiliary tasks in reinforcement learning