Policy search in continuous action domains: an overview

O Sigaud, F Stulp - Neural Networks, 2019 - Elsevier
Continuous action policy search is currently the focus of intensive research, driven both by
the recent success of deep reinforcement learning algorithms and the emergence of …

Practice makes perfect: An optimization-based approach to controlling agile motions for a quadruped robot

C Gehring, S Coros, M Hutter… - IEEE Robotics & …, 2016 - ieeexplore.ieee.org
This article approaches the problem of controlling quadrupedal running and jum**
motions with a parameterized, model-based, state-feedback controller. Inspired by the motor …

[PDF][PDF] Learning control

S Calinon, D Lee - Humanoid robotics: A reference, 2017 - mediatum.ub.tum.de
This chapter presents an overview of learning approaches for the acquisition of controllers
and movement skills in humanoid robots. The term learning control refers to the process of …

Ridm: Reinforced inverse dynamics modeling for learning from a single observed demonstration

BS Pavse, F Torabi, J Hanna… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org
Augmenting reinforcement learning with imitation learning is often hailed as a method by
which to improve upon learning from scratch. However, most existing methods for integrating …

A robust stability approach to robot reinforcement learning based on a parameterization of stabilizing controllers

SR Friedrich, M Buss - 2017 IEEE International Conference on …, 2017 - ieeexplore.ieee.org
Reinforcement learning has become more and more popular in robotics for acquiring
feedback controllers. Many approaches aim for learning a controller from scratch, ie, data …

[HTML][HTML] Mirror descent search and its acceleration

M Miyashita, S Yano, T Kondo - Robotics and Autonomous Systems, 2018 - Elsevier
In recent years, attention has been focused on the relationship between black-box
optimization problem and reinforcement learning problem. In this research, we propose the …

Direct state-to-action map** for high DOF robots using ELM

J Hwangbo, C Gehring, D Bellicoso… - 2015 IEEE/RSJ …, 2015 - ieeexplore.ieee.org
Methods of optimizing a single trajectory are mature enough for planning in many
applications. Yet such optimization methods applied to high Degree-Of-Freedom robots …

Directed exploration in black-box optimization for multi-objective reinforcement learning

J García, R Iglesias, MA Rodríguez… - International Journal of …, 2019 - World Scientific
Usually, real-world problems involve the optimization of multiple, possibly conflicting,
objectives. These problems may be addressed by Multi-objective Reinforcement learning …

Towards a unified framework of efficient algorithms for numerical optimal robot control

M Giftthaler - 2018 - research-collection.ethz.ch
In the recent past, the wide availability of digital technologies has strongly disrupted many
well-established manufacturing techniques and initiated a rapid transformation process in …

[PDF][PDF] Planning and control for agile quadruped robots

C Gehring - 2017 - research-collection.ethz.ch
Abstract Machines have facilitated our lives since centuries. The next generation of mobile
helpers may have the abilities and agility of legged animals and can thus support us in …