Deep reinforcement learning for robotics: A survey of real-world successes
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …
Learning-based legged locomotion: State of the art and future perspectives
Legged locomotion holds the premise of universal mobility, a critical capability for many real-
world robotic applications. Both model-based and learning-based approaches have …
world robotic applications. Both model-based and learning-based approaches have …
Language to rewards for robotic skill synthesis
Large language models (LLMs) have demonstrated exciting progress in acquiring diverse
new capabilities through in-context learning, ranging from logical reasoning to code-writing …
new capabilities through in-context learning, ranging from logical reasoning to code-writing …
Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control
This paper presents a comprehensive study on using deep reinforcement learning (RL) to
create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single …
create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single …
Walk these ways: Tuning robot control for generalization with multiplicity of behavior
Learned locomotion policies can rapidly adapt to diverse environments similar to those
experienced during training but lack a mechanism for fast tuning when they fail in an out-of …
experienced during training but lack a mechanism for fast tuning when they fail in an out-of …
Blind bipedal stair traversal via sim-to-real reinforcement learning
Accurate and precise terrain estimation is a difficult problem for robot locomotion in real-
world environments. Thus, it is useful to have systems that do not depend on accurate …
world environments. Thus, it is useful to have systems that do not depend on accurate …
Optimization-based control for dynamic legged robots
In a world designed for legs, quadrupeds, bipeds, and humanoids have the opportunity to
impact emerging robotics applications from logistics, to agriculture, to home assistance. The …
impact emerging robotics applications from logistics, to agriculture, to home assistance. The …
Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion
In this letter, we propose a locomotion training framework where a control policy and a state
estimator are trained concurrently. The framework consists of a policy network which outputs …
estimator are trained concurrently. The framework consists of a policy network which outputs …
High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning
Y **, X Liu, Y Shao, H Wang, W Yang - Nature Machine Intelligence, 2022 - nature.com
Fast and stable locomotion of legged robots involves demanding and contradictory
requirements, in particular rapid control frequency as well as an accurate dynamics model …
requirements, in particular rapid control frequency as well as an accurate dynamics model …
Adversarial motion priors make good substitutes for complex reward functions
Training a high-dimensional simulated agent with an under-specified reward function often
leads the agent to learn physically infeasible strategies that are ineffective when deployed in …
leads the agent to learn physically infeasible strategies that are ineffective when deployed in …