Deep reinforcement learning for robotics: A survey of real-world successes

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Machine learning for micro-and nanorobots

L Yang, J Jiang, F Ji, Y Li, KL Yung, A Ferreira… - Nature Machine …, 2024 - nature.com
Abstract Machine learning (ML) has revolutionized robotics by enhancing perception,
adaptability, decision-making and more, enabling robots to work in complex scenarios …

Learning robust perceptive locomotion for quadrupedal robots in the wild

T Miki, J Lee, J Hwangbo, L Wellhausen, V Koltun… - Science robotics, 2022 - science.org
Legged robots that can operate autonomously in remote and hazardous environments will
greatly increase opportunities for exploration into underexplored areas. Exteroceptive …

Anymal parkour: Learning agile navigation for quadrupedal robots

D Hoeller, N Rudin, D Sako, M Hutter - Science Robotics, 2024 - science.org
Performing agile navigation with four-legged robots is a challenging task because of the
highly dynamic motions, contacts with various parts of the robot, and the limited field of view …

Rewarded soups: towards pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

A Rame, G Couairon, C Dancette… - Advances in …, 2024 - proceedings.neurips.cc
Foundation models are first pre-trained on vast unsupervised datasets and then fine-tuned
on labeled data. Reinforcement learning, notably from human feedback (RLHF), can further …

Robot parkour learning

Z Zhuang, Z Fu, J Wang, C Atkeson… - arxiv preprint arxiv …, 2023 - arxiv.org
Parkour is a grand challenge for legged locomotion that requires robots to overcome various
obstacles rapidly in complex environments. Existing methods can generate either diverse …

Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion

G Ji, J Mun, H Kim, J Hwangbo - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
In this letter, we propose a locomotion training framework where a control policy and a state
estimator are trained concurrently. The framework consists of a policy network which outputs …

High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

Y **, X Liu, Y Shao, H Wang, W Yang - Nature Machine Intelligence, 2022 - nature.com
Fast and stable locomotion of legged robots involves demanding and contradictory
requirements, in particular rapid control frequency as well as an accurate dynamics model …

Animal robots in the African wilderness: Lessons learned and outlook for field robotics

K Melo, T Horvat, AJ Ijspeert - Science Robotics, 2023 - science.org
In early 2016, we had the opportunity to test a pair of sprawling posture robots, one
designed to mimic a crocodile and another designed to mimic a monitor lizard, along the …

Genloco: Generalized locomotion controllers for quadrupedal robots

G Feng, H Zhang, Z Li, XB Peng… - … on Robot Learning, 2023 - proceedings.mlr.press
Recent years have seen a surge in commercially-available and affordable quadrupedal
robots, with many of these platforms being actively used in research and industry. As the …