Google Acadèmic

RF Prudencio, MROA Maximo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the widespread adoption of deep learning, reinforcement learning (RL) has
experienced a dramatic increase in popularity, scaling to previously intractable problems …

Desa Cita Citat per 358 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

S Levine, A Kumar, G Tucker, J Fu - arxiv preprint arxiv:2005.01643, 2020 - arxiv.org

In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …

Desa Cita Citat per 2218 Articles relacionats Totes les 3 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Decision transformer: Reinforcement learning via sequence modeling

L Chen, K Lu, A Rajeswaran, K Lee… - Advances in neural …, 2021 - proceedings.neurips.cc

We introduce a framework that abstracts Reinforcement Learning (RL) as a sequence
modeling problem. This allows us to draw upon the simplicity and scalability of the …

Desa Cita Citat per 1827 Articles relacionats Totes les 13 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

The dormant neuron phenomenon in deep reinforcement learning

G Sokar, R Agarwal, PS Castro… - … Conference on Machine …, 2023 - proceedings.mlr.press

In this work we identify the dormant neuron phenomenon in deep reinforcement learning,
where an agent's network suffers from an increasing number of inactive neurons, thereby …

Desa Cita Citat per 97 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Offline reinforcement learning as one big sequence modeling problem

M Janner, Q Li, S Levine - Advances in neural information …, 2021 - proceedings.neurips.cc

Reinforcement learning (RL) is typically viewed as the problem of estimating single-step
policies (for model-free RL) or single-step models (for model-based RL), leveraging the …

Desa Cita Citat per 812 Articles relacionats Totes les 9 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Causal machine learning: A survey and open problems

J Kaddour, A Lynch, Q Liu, MJ Kusner… - arxiv preprint arxiv …, 2022 - arxiv.org

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods
that formalize the data-generation process as a structural causal model (SCM). This …

Desa Cita Citat per 186 Articles relacionats Totes les 2 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Generative skill chaining: Long-horizon skill planning with diffusion models

UA Mishra, S Xue, Y Chen… - Conference on Robot …, 2023 - proceedings.mlr.press

Long-horizon tasks, usually characterized by complex subtask dependencies, present a
significant challenge in manipulation planning. Skill chaining is a practical approach to …

Desa Cita Citat per 62 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Masked world models for visual control

Y Seo, D Hafner, H Liu, F Liu, S James… - … on Robot Learning, 2023 - proceedings.mlr.press

Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient
robot learning from visual observations. Yet the current approaches typically train a single …

Desa Cita Citat per 142 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Temporal difference learning for model predictive control

N Hansen, X Wang, H Su - arxiv preprint arxiv:2203.04955, 2022 - arxiv.org

Data-driven model predictive control has two key advantages over model-free methods: a
potential for improved sample efficiency through model learning, and better performance as …

Desa Cita Citat per 218 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek Cerca de biblioteques Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Synthetic experience replay

C Lu, P Ball, YW Teh… - Advances in Neural …, 2023 - proceedings.neurips.cc

A key theme in the past decade has been that when large neural networks and large
datasets combine they can produce remarkable results. In deep reinforcement learning (RL) …

Desa Cita Citat per 73 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek Versió HTML

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

When to trust your model: Model-based policy optimization

A survey on offline reinforcement learning: Taxonomy, review, and open problems

Offline reinforcement learning: Tutorial, review, and perspectives on open problems

Decision transformer: Reinforcement learning via sequence modeling

The dormant neuron phenomenon in deep reinforcement learning

Offline reinforcement learning as one big sequence modeling problem

Causal machine learning: A survey and open problems

Generative skill chaining: Long-horizon skill planning with diffusion models

Masked world models for visual control

Temporal difference learning for model predictive control

Synthetic experience replay