Scalable and cooperative deep reinforcement learning approaches for multi-UAV systems: A systematic review
In recent years, the use of multiple unmanned aerial vehicles (UAVs) in various applications
has progressively increased thanks to advancements in multi-agent system technology …
has progressively increased thanks to advancements in multi-agent system technology …
Posterior sampling with delayed feedback for reinforcement learning with linear function approximation
Recent studies in reinforcement learning (RL) have made significant progress by leveraging
function approximation to alleviate the sample complexity hurdle for better performance …
function approximation to alleviate the sample complexity hurdle for better performance …
Belief projection-based reinforcement learning for environments with delayed feedback
We present a novel actor-critic algorithm for an environment with delayed feedback, which
addresses the state-space explosion problem of conventional approaches. Conventional …
addresses the state-space explosion problem of conventional approaches. Conventional …
Efficient rl with impaired observability: Learning to act with delayed and missing state observations
In real-world reinforcement learning (RL) systems, various forms of {\it impaired
observability} can complicate matters. These situations arise when an agent is unable to …
observability} can complicate matters. These situations arise when an agent is unable to …
Neural Laplace control for continuous-time delayed systems
Many real-world offline reinforcement learning (RL) problems involve continuous-time
environments with delays. Such environments are characterized by two distinctive features …
environments with delays. Such environments are characterized by two distinctive features …
Realtime reinforcement learning: Towards rapid asynchronous deployment of large models
Realtime environments change even as agents perform action inference and learning, thus
requiring high interaction frequencies to effectively minimize long-term regret. However …
requiring high interaction frequencies to effectively minimize long-term regret. However …
Finite-set direct torque control via edge computing-assisted safe reinforcement learning for a permanent magnet synchronous motor
Advances in the field of reinforcement learning (RL)-based drive control allow formulation of
holistic optimization goals for the data-driven training phase. The resulting controllers …
holistic optimization goals for the data-driven training phase. The resulting controllers …
Revisiting state augmentation methods for reinforcement learning with stochastic delays
Several real-world scenarios, such as remote control and sensing, are comprised of action
and observation delays. The presence of delays degrades the performance of reinforcement …
and observation delays. The presence of delays degrades the performance of reinforcement …
Delayed reinforcement learning by imitation
When the agent's observations or interactions are delayed, classic reinforcement learning
tools usually fail. In this paper, we propose a simple yet new and efficient solution to this …
tools usually fail. In this paper, we propose a simple yet new and efficient solution to this …
Scalable reinforcement learning framework for traffic signal control under communication delays
Vehicle-to-everything (V2X) technology is pivotal for enhancing road safety, traffic efficiency,
and energy conservation through the communication of vehicles with their surrounding …
and energy conservation through the communication of vehicles with their surrounding …