A review of uncertainty for deep reinforcement learning

O Lockwood, M Si - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Uncertainty is ubiquitous in games, both in the agents playing games and often in the games
themselves. Working with uncertainty is therefore an important component of successful …

Learning-based legged locomotion: State of the art and future perspectives

S Ha, J Lee, M van de Panne, Z **e… - … Journal of Robotics …, 2024 - journals.sagepub.com
Legged locomotion holds the premise of universal mobility, a critical capability for many real-
world robotic applications. Both model-based and learning-based approaches have …

Efficient online reinforcement learning with offline data

PJ Ball, L Smith, I Kostrikov… - … Conference on Machine …, 2023 - proceedings.mlr.press
Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …

The dormant neuron phenomenon in deep reinforcement learning

G Sokar, R Agarwal, PS Castro… - … Conference on Machine …, 2023 - proceedings.mlr.press
In this work we identify the dormant neuron phenomenon in deep reinforcement learning,
where an agent's network suffers from an increasing number of inactive neurons, thereby …

A walk in the park: Learning to walk in 20 minutes with model-free reinforcement learning

L Smith, I Kostrikov, S Levine - arxiv preprint arxiv:2208.07860, 2022 - arxiv.org
Deep reinforcement learning is a promising approach to learning policies in uncontrolled
environments that do not require domain knowledge. Unfortunately, due to sample …

Plastic: Improving input and label plasticity for sample efficient reinforcement learning

H Lee, H Cho, H Kim, D Gwak, J Kim… - Advances in …, 2024 - proceedings.neurips.cc
Abstract In Reinforcement Learning (RL), enhancing sample efficiency is crucial, particularly
in scenarios when data acquisition is costly and risky. In principle, off-policy RL algorithms …

Learning and adapting agile locomotion skills by transferring experience

L Smith, JC Kew, T Li, L Luu, XB Peng, S Ha… - arxiv preprint arxiv …, 2023 - arxiv.org
Legged robots have enormous potential in their range of capabilities, from navigating
unstructured terrains to high-speed running. However, designing robust controllers for highly …

[PDF][PDF] A survey on uncertainty quantification methods for deep learning

W He, Z Jiang, T **ao, Z Xu, Y Li - arxiv preprint arxiv:2302.13425, 2023 - jiangteam.org
A Survey on Uncertainty Quantification Methods for Deep Neural Networks: An Uncertainty
Source's Perspective Page 1 A Survey on Uncertainty Quantification Methods for Deep Neural …

Metra: Scalable unsupervised rl with metric-aware abstraction

S Park, O Rybkin, S Levine - arxiv preprint arxiv:2310.08887, 2023 - arxiv.org
Unsupervised pre-training strategies have proven to be highly effective in natural language
processing and computer vision. Likewise, unsupervised reinforcement learning (RL) holds …

Revisiting the minimalist approach to offline reinforcement learning

D Tarasov, V Kurenkov, A Nikulin… - Advances in Neural …, 2024 - proceedings.neurips.cc
Recent years have witnessed significant advancements in offline reinforcement learning
(RL), resulting in the development of numerous algorithms with varying degrees of …