- Academic Search

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org

Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

Salva Cita Citato da 24 Articoli correlati Tutte e 3 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning-based legged locomotion: State of the art and future perspectives

S Ha, J Lee, M van de Panne, Z **e… - … Journal of Robotics …, 2024 - journals.sagepub.com

Legged locomotion holds the premise of universal mobility, a critical capability for many real-
world robotic applications. Both model-based and learning-based approaches have …

Salva Cita Citato da 10 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Language to rewards for robotic skill synthesis

W Yu, N Gileadi, C Fu, S Kirmani, KH Lee… - arxiv preprint arxiv …, 2023 - arxiv.org

Large language models (LLMs) have demonstrated exciting progress in acquiring diverse
new capabilities through in-context learning, ranging from logical reasoning to code-writing …

Salva Cita Citato da 262 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control

Z Li, XB Peng, P Abbeel, S Levine… - … Journal of Robotics …, 2024 - journals.sagepub.com

This paper presents a comprehensive study on using deep reinforcement learning (RL) to
create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single …

Salva Cita Citato da 46 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Walk these ways: Tuning robot control for generalization with multiplicity of behavior

GB Margolis, P Agrawal - Conference on Robot Learning, 2023 - proceedings.mlr.press

Learned locomotion policies can rapidly adapt to diverse environments similar to those
experienced during training but lack a mechanism for fast tuning when they fail in an out-of …

Salva Cita Citato da 143 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Blind bipedal stair traversal via sim-to-real reinforcement learning

J Siekmann, K Green, J Warila, A Fern… - arxiv preprint arxiv …, 2021 - arxiv.org

Accurate and precise terrain estimation is a difficult problem for robot locomotion in real-
world environments. Thus, it is useful to have systems that do not depend on accurate …

Salva Cita Citato da 212 Articoli correlati Tutte e 8 le versioni Versione HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Optimization-based control for dynamic legged robots

PM Wensing, M Posa, Y Hu, A Escande… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

In a world designed for legs, quadrupeds, bipeds, and humanoids have the opportunity to
impact emerging robotics applications from logistics, to agriculture, to home assistance. The …

Salva Cita Citato da 129 Articoli correlati Tutte e 9 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion

G Ji, J Mun, H Kim, J Hwangbo - IEEE Robotics and …, 2022 - ieeexplore.ieee.org

In this letter, we propose a locomotion training framework where a control policy and a state
estimator are trained concurrently. The framework consists of a policy network which outputs …

Salva Cita Citato da 156 Articoli correlati Tutte e 4 le versioni

High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

Y **, X Liu, Y Shao, H Wang, W Yang - Nature Machine Intelligence, 2022 - nature.com

Fast and stable locomotion of legged robots involves demanding and contradictory
requirements, in particular rapid control frequency as well as an accurate dynamics model …

Salva Cita Citato da 48 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adversarial motion priors make good substitutes for complex reward functions

A Escontrela, XB Peng, W Yu, T Zhang… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org

Training a high-dimensional simulated agent with an under-specified reward function often
leads the agent to learn physically infeasible strategies that are ineffective when deployed in …

Salva Cita Citato da 105 Articoli correlati Tutte e 5 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Sim-to-real learning of all common bipedal gaits via periodic reward composition

Deep reinforcement learning for robotics: A survey of real-world successes

Learning-based legged locomotion: State of the art and future perspectives

Language to rewards for robotic skill synthesis

Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control

Walk these ways: Tuning robot control for generalization with multiplicity of behavior

Blind bipedal stair traversal via sim-to-real reinforcement learning

Optimization-based control for dynamic legged robots

Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion

High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

Adversarial motion priors make good substitutes for complex reward functions