Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Transfer learning in deep reinforcement learning: A survey
Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …
problems. Recent years have witnessed remarkable progress in reinforcement learning …
Autonomous navigation of UAVs in large-scale complex environments: A deep reinforcement learning approach
In this paper, we propose a deep reinforcement learning (DRL)-based method that allows
unmanned aerial vehicles (UAVs) to execute navigation tasks in large-scale complex …
unmanned aerial vehicles (UAVs) to execute navigation tasks in large-scale complex …
Learning to utilize sha** rewards: A new approach of reward sha**
Reward sha** is an effective technique for incorporating domain knowledge into
reinforcement learning (RL). Existing approaches such as potential-based reward sha** …
reinforcement learning (RL). Existing approaches such as potential-based reward sha** …
Rudder: Return decomposition for delayed rewards
We propose RUDDER, a novel reinforcement learning approach for delayed rewards in
finite Markov decision processes (MDPs). In MDPs the Q-values are equal to the expected …
finite Markov decision processes (MDPs). In MDPs the Q-values are equal to the expected …
Deep reinforcement learning and reward sha** based eco-driving control for automated HEVs among signalized intersections
J Li, X Wu, M Xu, Y Liu - Energy, 2022 - Elsevier
In a connected traffic environment with signalized intersections, eco-driving control needs to
co-optimize fuel economy (fuel consumption), driving safety (collisions and red lights), and …
co-optimize fuel economy (fuel consumption), driving safety (collisions and red lights), and …
Human-centered reinforcement learning: A survey
Human-centered reinforcement learning (RL), in which an agent learns how to perform a
task from evaluative feedback delivered by a human observer, has become more and more …
task from evaluative feedback delivered by a human observer, has become more and more …
Reward sha** to improve the performance of deep reinforcement learning in perishable inventory management
Deep reinforcement learning (DRL) has proven to be an effective, general-purpose
technology to develop 'good'replenishment policies in inventory management. We show …
technology to develop 'good'replenishment policies in inventory management. We show …
State abstractions for lifelong reinforcement learning
In lifelong reinforcement learning, agents must effectively transfer knowledge across tasks
while simultaneously addressing exploration, credit assignment, and generalization. State …
while simultaneously addressing exploration, credit assignment, and generalization. State …
Comprehensive overview of reward engineering and sha** in advancing reinforcement learning applications
Reinforcement Learning (RL) seeks to develop systems capable of autonomous decision-
making by learning through interaction with their environment. Central to this process are …
making by learning through interaction with their environment. Central to this process are …
Explicable reward design for reinforcement learning agents
We study the design of explicable reward functions for a reinforcement learning agent while
guaranteeing that an optimal policy induced by the function belongs to a set of target …
guaranteeing that an optimal policy induced by the function belongs to a set of target …