Transfer learning in deep reinforcement learning: A survey

Z Zhu, K Lin, AK Jain, J Zhou - IEEE Transactions on Pattern …, 2023 - ieeexplore.ieee.org
Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …

Autonomous navigation of UAVs in large-scale complex environments: A deep reinforcement learning approach

C Wang, J Wang, Y Shen… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
In this paper, we propose a deep reinforcement learning (DRL)-based method that allows
unmanned aerial vehicles (UAVs) to execute navigation tasks in large-scale complex …

Learning to utilize sha** rewards: A new approach of reward sha**

Y Hu, W Wang, H Jia, Y Wang… - Advances in …, 2020 - proceedings.neurips.cc
Reward sha** is an effective technique for incorporating domain knowledge into
reinforcement learning (RL). Existing approaches such as potential-based reward sha** …

Rudder: Return decomposition for delayed rewards

JA Arjona-Medina, M Gillhofer… - Advances in …, 2019 - proceedings.neurips.cc
We propose RUDDER, a novel reinforcement learning approach for delayed rewards in
finite Markov decision processes (MDPs). In MDPs the Q-values are equal to the expected …

Deep reinforcement learning and reward sha** based eco-driving control for automated HEVs among signalized intersections

J Li, X Wu, M Xu, Y Liu - Energy, 2022 - Elsevier
In a connected traffic environment with signalized intersections, eco-driving control needs to
co-optimize fuel economy (fuel consumption), driving safety (collisions and red lights), and …

Human-centered reinforcement learning: A survey

G Li, R Gomez, K Nakamura… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Human-centered reinforcement learning (RL), in which an agent learns how to perform a
task from evaluative feedback delivered by a human observer, has become more and more …

Reward sha** to improve the performance of deep reinforcement learning in perishable inventory management

BJ De Moor, J Gijsbrechts, RN Boute - European Journal of Operational …, 2022 - Elsevier
Deep reinforcement learning (DRL) has proven to be an effective, general-purpose
technology to develop 'good'replenishment policies in inventory management. We show …

State abstractions for lifelong reinforcement learning

D Abel, D Arumugam, L Lehnert… - … on Machine Learning, 2018 - proceedings.mlr.press
In lifelong reinforcement learning, agents must effectively transfer knowledge across tasks
while simultaneously addressing exploration, credit assignment, and generalization. State …

Comprehensive overview of reward engineering and sha** in advancing reinforcement learning applications

S Ibrahim, M Mostafa, A Jnadi, H Salloum… - IEEE …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) seeks to develop systems capable of autonomous decision-
making by learning through interaction with their environment. Central to this process are …

Explicable reward design for reinforcement learning agents

R Devidze, G Radanovic… - Advances in neural …, 2021 - proceedings.neurips.cc
We study the design of explicable reward functions for a reinforcement learning agent while
guaranteeing that an optimal policy induced by the function belongs to a set of target …