Google Akademik

Kaydet Alıntı yap Alıntılanma sayısı: 2178 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Kaydet Alıntı yap Alıntılanma sayısı: 360 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

Is conditional generative modeling all you need for decision-making?

A Ajay, Y Du, A Gupta, J Tenenbaum… - arxiv preprint arxiv …, 2022 - arxiv.org

Recent improvements in conditional generative modeling have made it possible to generate
high-quality images from language descriptions alone. We investigate whether these …

Kaydet Alıntı yap Alıntılanma sayısı: 866 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

Offline reinforcement learning with implicit q-learning

I Kostrikov, A Nair, S Levine - arxiv preprint arxiv:2110.06169, 2021 - arxiv.org

Offline reinforcement learning requires reconciling two conflicting aims: learning a policy that
improves over the behavior policy that collected the dataset, while at the same time …

Kaydet Alıntı yap Alıntılanma sayısı: 85 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

[PDF] mlr.press

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

Y Chebotar, Q Vuong, K Hausman… - … on Robot Learning, 2023 - proceedings.mlr.press

In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …

Kaydet Alıntı yap Alıntılanma sayısı: 322 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

Diffusion policies as an expressive policy class for offline reinforcement learning

Z Wang, JJ Hunt, M Zhou - arxiv preprint arxiv:2208.06193, 2022 - arxiv.org

Offline reinforcement learning (RL), which aims to learn an optimal policy using a previously
collected static dataset, is an important paradigm of RL. Standard RL methods often perform …

Kaydet Alıntı yap Alıntılanma sayısı: 1799 İlgili makaleler 11 sürümün hepsi HTML olarak görüntüle

Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

Kaydet Alıntı yap Alıntılanma sayısı: 869 İlgili makaleler 6 sürümün hepsi HTML olarak görüntüle

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

Kaydet Alıntı yap Alıntılanma sayısı: 793 İlgili makaleler 9 sürümün hepsi HTML olarak görüntüle

Offline reinforcement learning as one big sequence modeling problem

M Janner, Q Li, S Levine - Advances in neural information …, 2021 - proceedings.neurips.cc

Reinforcement learning (RL) is typically viewed as the problem of estimating single-step
policies (for model-free RL) or single-step models (for model-based RL), leveraging the …