How to train your robot with deep reinforcement learning: lessons we have learned

J Ibarz, J Tan, C Finn, M Kalakrishnan… - … Journal of Robotics …, 2021 - journals.sagepub.com
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …

A review of robot learning for manipulation: Challenges, representations, and algorithms

O Kroemer, S Niekum, G Konidaris - Journal of machine learning research, 2021 - jmlr.org
A key challenge in intelligent robotics is creating robots that are capable of directly
interacting with the world around them to achieve their goals. The last decade has seen …

Combining model-based and model-free updates for trajectory-centric reinforcement learning

Y Chebotar, K Hausman, M Zhang… - International …, 2017 - proceedings.mlr.press
Reinforcement learning algorithms for real-world robotic applications must be able to handle
complex, unknown dynamical systems while maintaining data-efficient learning. These …

Hierarchical relative entropy policy search

C Daniel, G Neumann, O Kroemer, J Peters - Journal of Machine Learning …, 2016 - jmlr.org
Many reinforcement learning (RL) tasks, especially in robotics, consist of multiple sub-tasks
that are strongly structured. Such task structures can be exploited by incorporating …

Online multi-task learning for policy gradient methods

HB Ammar, E Eaton, P Ruvolo… - … conference on machine …, 2014 - proceedings.mlr.press
Policy gradient algorithms have shown considerable recent success in solving high-
dimensional sequential decision making tasks, particularly in robotics. However, these …

Probabilistic inference for determining options in reinforcement learning

C Daniel, H Van Hoof, J Peters, G Neumann - Machine Learning, 2016 - Springer
Tasks that require many sequential decisions or complex solutions are hard to solve using
conventional reinforcement learning algorithms. Based on the semi Markov decision …

Towards learning hierarchical skills for multi-phase manipulation tasks

O Kroemer, C Daniel, G Neumann… - … on robotics and …, 2015 - ieeexplore.ieee.org
Most manipulation tasks can be decomposed into a sequence of phases, where the robot's
actions have different effects in each phase. The robot can perform actions to transition …

[PDF][PDF] Active Reward Learning.

C Daniel, M Viering, J Metz… - Robotics: Science …, 2014 - ias.informatik.tu-darmstadt.de
While reward functions are an essential component of many robot learning methods,
defining such functions remains a hard problem in many practical applications. For tasks …

Learning generalizable locomotion skills with hierarchical reinforcement learning

T Li, N Lambert, R Calandra, F Meier… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Learning to locomote to arbitrary goals on hardware remains a challenging problem for
reinforcement learning. In this paper, we present a hierarchical framework that improves …

Learning step size controllers for robust neural network training

C Daniel, J Taylor, S Nowozin - … of the AAAI Conference on Artificial …, 2016 - ojs.aaai.org
This paper investigates algorithms to automatically adapt the learning rate of neural
networks (NNs). Starting with stochastic gradient descent, a large variety of learning …