Reinforcement learning in healthcare: A survey
As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …
making by using interaction samples of an agent with its environment and the potentially …
Reinforcement learning in robotic applications: a comprehensive survey
In recent trends, artificial intelligence (AI) is used for the creation of complex automated
control systems. Still, researchers are trying to make a completely autonomous system that …
control systems. Still, researchers are trying to make a completely autonomous system that …
Advantage-weighted regression: Simple and scalable off-policy reinforcement learning
In this paper, we aim to develop a simple and scalable reinforcement learning algorithm that
uses standard supervised learning methods as subroutines. Our goal is an algorithm that …
uses standard supervised learning methods as subroutines. Our goal is an algorithm that …
Deep reinforcement learning in a handful of trials using probabilistic dynamics models
Abstract Model-based reinforcement learning (RL) algorithms can attain excellent sample
efficiency, but often lag behind the best model-free algorithms in terms of asymptotic …
efficiency, but often lag behind the best model-free algorithms in terms of asymptotic …
Learning complex dexterous manipulation with deep reinforcement learning and demonstrations
Dexterous multi-fingered hands are extremely versatile and provide a generic way to
perform a multitude of tasks in human-centric environments. However, effectively controlling …
perform a multitude of tasks in human-centric environments. However, effectively controlling …
Overcoming exploration in reinforcement learning with demonstrations
Exploration in environments with sparse rewards has been a persistent problem in
reinforcement learning (RL). Many tasks are natural to specify with a sparse reward, and …
reinforcement learning (RL). Many tasks are natural to specify with a sparse reward, and …