Deep reinforcement learning for robotics: A survey of real-world successes
Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …
[HTML][HTML] Deep Learning applications for COVID-19
This survey explores how Deep Learning has battled the COVID-19 pandemic and provides
directions for future research on COVID-19. We cover Deep Learning applications in Natural …
directions for future research on COVID-19. We cover Deep Learning applications in Natural …
Ai alignment: A comprehensive survey
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …
AI systems grow more capable, the potential large-scale risks associated with misaligned AI …
Human-to-robot imitation in the wild
We approach the problem of learning by watching humans in the wild. While traditional
approaches in Imitation and Reinforcement Learning are promising for learning in the real …
approaches in Imitation and Reinforcement Learning are promising for learning in the real …
Few-shot preference learning for human-in-the-loop rl
While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …
designing sufficiently informative reward functions for complex tasks has proven to be …
Inverse preference learning: Preference-based rl without a reward function
Reward functions are difficult to design and often hard to align with human intent. Preference-
based Reinforcement Learning (RL) algorithms address these problems by learning reward …
based Reinforcement Learning (RL) algorithms address these problems by learning reward …
Global optimality guarantees for policy gradient methods
Policy gradients methods apply to complex, poorly understood, control problems by
performing stochastic gradient descent over a parameterized class of polices. Unfortunately …
performing stochastic gradient descent over a parameterized class of polices. Unfortunately …
Hierarchical reinforcement learning: A survey and open research challenges
Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …
by interacting with an environment in a trial-and-error fashion. When these environments are …
Genloco: Generalized locomotion controllers for quadrupedal robots
Recent years have seen a surge in commercially-available and affordable quadrupedal
robots, with many of these platforms being actively used in research and industry. As the …
robots, with many of these platforms being actively used in research and industry. As the …
Model-free reinforcement learning from expert demonstrations: a survey
Reinforcement learning from expert demonstrations (RLED) is the intersection of imitation
learning with reinforcement learning that seeks to take advantage of these two learning …
learning with reinforcement learning that seeks to take advantage of these two learning …