- Academic Search

O Lockwood, M Si - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

Uncertainty is ubiquitous in games, both in the agents playing games and often in the games
themselves. Working with uncertainty is therefore an important component of successful …

Speichern Zitieren Zitiert von: 61 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Learning-based legged locomotion: State of the art and future perspectives

S Ha, J Lee, M van de Panne, Z **e… - … Journal of Robotics …, 2024 - journals.sagepub.com

Legged locomotion holds the premise of universal mobility, a critical capability for many real-
world robotic applications. Both model-based and learning-based approaches have …

Speichern Zitieren Zitiert von: 10 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] mlr.press

Efficient online reinforcement learning with offline data

PJ Ball, L Smith, I Kostrikov… - … Conference on Machine …, 2023 - proceedings.mlr.press

Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …

Speichern Zitieren Zitiert von: 143 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

The dormant neuron phenomenon in deep reinforcement learning

G Sokar, R Agarwal, PS Castro… - … Conference on Machine …, 2023 - proceedings.mlr.press

In this work we identify the dormant neuron phenomenon in deep reinforcement learning,
where an agent's network suffers from an increasing number of inactive neurons, thereby …

Speichern Zitieren Zitiert von: 89 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

A walk in the park: Learning to walk in 20 minutes with model-free reinforcement learning

L Smith, I Kostrikov, S Levine - arxiv preprint arxiv:2208.07860, 2022 - arxiv.org

Deep reinforcement learning is a promising approach to learning policies in uncontrolled
environments that do not require domain knowledge. Unfortunately, due to sample …

Speichern Zitieren Zitiert von: 107 Ähnliche Artikel Alle 2 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Plastic: Improving input and label plasticity for sample efficient reinforcement learning

H Lee, H Cho, H Kim, D Gwak, J Kim… - Advances in …, 2024 - proceedings.neurips.cc

Abstract In Reinforcement Learning (RL), enhancing sample efficiency is crucial, particularly
in scenarios when data acquisition is costly and risky. In principle, off-policy RL algorithms …

Speichern Zitieren Zitiert von: 24 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Learning and adapting agile locomotion skills by transferring experience

L Smith, JC Kew, T Li, L Luu, XB Peng, S Ha… - arxiv preprint arxiv …, 2023 - arxiv.org

Legged robots have enormous potential in their range of capabilities, from navigating
unstructured terrains to high-speed running. However, designing robust controllers for highly …

Speichern Zitieren Zitiert von: 56 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] jiangteam.org

[PDF][PDF] A survey on uncertainty quantification methods for deep learning

W He, Z Jiang, T **ao, Z Xu, Y Li - arxiv preprint arxiv:2302.13425, 2023 - jiangteam.org

A Survey on Uncertainty Quantification Methods for Deep Neural Networks: An Uncertainty
Source's Perspective Page 1 A Survey on Uncertainty Quantification Methods for Deep Neural …

Speichern Zitieren Zitiert von: 47 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Metra: Scalable unsupervised rl with metric-aware abstraction

S Park, O Rybkin, S Levine - arxiv preprint arxiv:2310.08887, 2023 - arxiv.org

Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

Speichern Zitieren Zitiert von: 30 Ähnliche Artikel Alle 4 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Revisiting the minimalist approach to offline reinforcement learning

D Tarasov, V Kurenkov, A Nikulin… - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent years have witnessed significant advancements in offline reinforcement learning
(RL), resulting in the development of numerous algorithms with varying degrees of …

Speichern Zitieren Zitiert von: 28 Ähnliche Artikel Alle 6 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Dropout q-functions for doubly efficient reinforcement learning

A review of uncertainty for deep reinforcement learning

Learning-based legged locomotion: State of the art and future perspectives

Efficient online reinforcement learning with offline data

The dormant neuron phenomenon in deep reinforcement learning

A walk in the park: Learning to walk in 20 minutes with model-free reinforcement learning

Plastic: Improving input and label plasticity for sample efficient reinforcement learning

Learning and adapting agile locomotion skills by transferring experience

[PDF][PDF] A survey on uncertainty quantification methods for deep learning

Metra: Scalable unsupervised rl with metric-aware abstraction

Revisiting the minimalist approach to offline reinforcement learning