Reinforcement learning in robotic applications: a comprehensive survey

B Singh, R Kumar, VP Singh - Artificial Intelligence Review, 2022 - Springer
In recent trends, artificial intelligence (AI) is used for the creation of complex automated
control systems. Still, researchers are trying to make a completely autonomous system that …

A survey on deep learning and deep reinforcement learning in robotics with a tutorial on deep reinforcement learning

EF Morales, R Murrieta-Cid, I Becerra… - Intelligent Service …, 2021 - Springer
This article is about deep learning (DL) and deep reinforcement learning (DRL) works
applied to robotics. Both tools have been shown to be successful in delivering data-driven …

Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …

Interactive imitation learning in robotics: A survey

C Celemin, R Pérez-Dattari, E Chisari… - … and Trends® in …, 2022 - nowpublishers.com
Interactive Imitation Learning in Robotics: A Survey Page 1 Interactive Imitation Learning in
Robotics: A Survey Page 2 Other titles in Foundations and Trends® in Robotics A Survey on …

Interactive learning from policy-dependent human feedback

J MacGlashan, MK Ho, R Loftin… - International …, 2017 - proceedings.mlr.press
This paper investigates the problem of interactively learning behaviors communicated by a
human teacher using positive and negative feedback. Much previous work on this problem …

Policy sha**: Integrating human feedback with reinforcement learning

S Griffith, K Subramanian, J Scholz… - Advances in neural …, 2013 - proceedings.neurips.cc
A long term goal of Interactive Reinforcement Learning is to incorporate non-expert human
feedback to solve complex tasks. State-of-the-art methods have approached this problem by …

Proximal policy optimization with policy feedback

Y Gu, Y Cheng, CLP Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Proximal policy optimization (PPO) is a deep reinforcement learning algorithm based on the
actor–critic (AC) architecture. In the classic AC architecture, the Critic (value) network is used …

[KNIHA][B] Robot learning from human teachers

S Chernova, AL Thomaz - 2022 - books.google.com
Learning from Demonstration (LfD) explores techniques for learning a task policy from
examples provided by a human teacher. The field of LfD has grown into an extensive body …

Human-centered reinforcement learning: A survey

G Li, R Gomez, K Nakamura… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Human-centered reinforcement learning (RL), in which an agent learns how to perform a
task from evaluative feedback delivered by a human observer, has become more and more …

A review on interactive reinforcement learning from human social feedback

J Lin, Z Ma, R Gomez, K Nakamura, B He, G Li - IEEE Access, 2020 - ieeexplore.ieee.org
Reinforcement learning agent learns how to perform a task by interacting with the
environment. The use of reinforcement learning in real-life applications has been limited …