- Academic Search

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

保存引用被引用数: 350 関連記事全 9 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on model-based reinforcement learning

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024 - Springer

Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …

保存引用被引用数: 110 関連記事全 4 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

A minimalist approach to offline reinforcement learning

S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc

Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …

保存引用被引用数: 861 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Mildly conservative q-learning for offline reinforcement learning

J Lyu, X Ma, X Li, Z Lu - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Offline reinforcement learning (RL) defines the task of learning from a static logged dataset
without continually interacting with the environment. The distribution shift between the …

保存引用被引用数: 126 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Critic regularized regression

Z Wang, A Novikov, K Zolna, JS Merel… - Advances in …, 2020 - proceedings.neurips.cc

Offline reinforcement learning (RL), also known as batch RL, offers the prospect of policy
optimization from large pre-recorded datasets without online environment interaction. It …

保存引用被引用数: 350 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Offline rl without off-policy evaluation

D Brandfonbrener, W Whitney… - Advances in neural …, 2021 - proceedings.neurips.cc

Most prior approaches to offline reinforcement learning (RL) have taken an iterative actor-
critic approach involving off-policy evaluation. In this paper we show that simply doing one …

保存引用被引用数: 184 関連記事全 10 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

G Dulac-Arnold, N Levine, DJ Mankowitz, J Li… - Machine Learning, 2021 - Springer

Reinforcement learning (RL) has proven its worth in a series of artificial domains, and is
beginning to show some successes in real-world scenarios. However, much of the research …

保存引用被引用数: 578 関連記事全 6 バージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Autonomous evaluation and refinement of digital agents

J Pan, Y Zhang, N Tomlin, Y Zhou, S Levine… - arxiv preprint arxiv …, 2024 - arxiv.org

We show that domain-general automatic evaluators can significantly improve the
performance of agents for web navigation and device control. We experiment with multiple …

保存引用被引用数: 37 関連記事全 2 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Offline reinforcement learning via high-fidelity generative behavior modeling

H Chen, C Lu, C Ying, H Su, J Zhu - arxiv preprint arxiv:2209.14548, 2022 - arxiv.org

In offline reinforcement learning, weighted regression is a common method to ensure the
learned policy stays close to the behavior policy and to prevent selecting out-of-sample …

保存引用被引用数: 95 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Q-learning decision transformer: Leveraging dynamic programming for conditional sequence modelling in offline rl

T Yamagata, A Khalil… - … on Machine Learning, 2023 - proceedings.mlr.press

Recent works have shown that tackling offline reinforcement learning (RL) with a conditional
policy produces promising results. The Decision Transformer (DT) combines the conditional …

保存引用被引用数: 80 関連記事全 9 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Bail: Best-action imitation learning for batch deep reinforcement learning

A survey on offline reinforcement learning: Taxonomy, review, and open problems

A survey on model-based reinforcement learning

A minimalist approach to offline reinforcement learning

Mildly conservative q-learning for offline reinforcement learning

Critic regularized regression

Offline rl without off-policy evaluation

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Autonomous evaluation and refinement of digital agents

Offline reinforcement learning via high-fidelity generative behavior modeling

Q-learning decision transformer: Leveraging dynamic programming for conditional sequence modelling in offline rl