Deep reinforcement learning for robotics: A survey of real-world successes

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Learning-based legged locomotion: State of the art and future perspectives

S Ha, J Lee, M van de Panne, Z **e… - … Journal of Robotics …, 2024 - journals.sagepub.com
Legged locomotion holds the premise of universal mobility, a critical capability for many real-
world robotic applications. Both model-based and learning-based approaches have …

Language to rewards for robotic skill synthesis

W Yu, N Gileadi, C Fu, S Kirmani, KH Lee… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated exciting progress in acquiring diverse
new capabilities through in-context learning, ranging from logical reasoning to code-writing …

Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control

Z Li, XB Peng, P Abbeel, S Levine… - … Journal of Robotics …, 2024 - journals.sagepub.com
This paper presents a comprehensive study on using deep reinforcement learning (RL) to
create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single …

Walk these ways: Tuning robot control for generalization with multiplicity of behavior

GB Margolis, P Agrawal - Conference on Robot Learning, 2023 - proceedings.mlr.press
Learned locomotion policies can rapidly adapt to diverse environments similar to those
experienced during training but lack a mechanism for fast tuning when they fail in an out-of …

Blind bipedal stair traversal via sim-to-real reinforcement learning

J Siekmann, K Green, J Warila, A Fern… - arxiv preprint arxiv …, 2021 - arxiv.org
Accurate and precise terrain estimation is a difficult problem for robot locomotion in real-
world environments. Thus, it is useful to have systems that do not depend on accurate …

Optimization-based control for dynamic legged robots

PM Wensing, M Posa, Y Hu, A Escande… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
In a world designed for legs, quadrupeds, bipeds, and humanoids have the opportunity to
impact emerging robotics applications from logistics, to agriculture, to home assistance. The …

Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion

G Ji, J Mun, H Kim, J Hwangbo - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
In this letter, we propose a locomotion training framework where a control policy and a state
estimator are trained concurrently. The framework consists of a policy network which outputs …

High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

Y **, X Liu, Y Shao, H Wang, W Yang - Nature Machine Intelligence, 2022 - nature.com
Fast and stable locomotion of legged robots involves demanding and contradictory
requirements, in particular rapid control frequency as well as an accurate dynamics model …

Adversarial motion priors make good substitutes for complex reward functions

A Escontrela, XB Peng, W Yu, T Zhang… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
Training a high-dimensional simulated agent with an under-specified reward function often
leads the agent to learn physically infeasible strategies that are ineffective when deployed in …