Scalable and cooperative deep reinforcement learning approaches for multi-UAV systems: A systematic review

F Frattolillo, D Brunori, L Iocchi - Drones, 2023 - mdpi.com
In recent years, the use of multiple unmanned aerial vehicles (UAVs) in various applications
has progressively increased thanks to advancements in multi-agent system technology …

Posterior sampling with delayed feedback for reinforcement learning with linear function approximation

NL Kuang, M Yin, M Wang… - Advances in Neural …, 2023 - proceedings.neurips.cc
Recent studies in reinforcement learning (RL) have made significant progress by leveraging
function approximation to alleviate the sample complexity hurdle for better performance …

Belief projection-based reinforcement learning for environments with delayed feedback

J Kim, H Kim, J Kang, J Baek… - Advances in Neural …, 2023 - proceedings.neurips.cc
We present a novel actor-critic algorithm for an environment with delayed feedback, which
addresses the state-space explosion problem of conventional approaches. Conventional …

Efficient rl with impaired observability: Learning to act with delayed and missing state observations

M Chen, Y Bai, HV Poor… - Advances in Neural …, 2024 - proceedings.neurips.cc
In real-world reinforcement learning (RL) systems, various forms of {\it impaired
observability} can complicate matters. These situations arise when an agent is unable to …

Neural Laplace control for continuous-time delayed systems

S Holt, A Hüyük, Z Qian, H Sun… - International …, 2023 - proceedings.mlr.press
Many real-world offline reinforcement learning (RL) problems involve continuous-time
environments with delays. Such environments are characterized by two distinctive features …

Realtime reinforcement learning: Towards rapid asynchronous deployment of large models

M Riemer, G Subbaraj, G Berseth… - The Thirteenth International …, 2024 - openreview.net
Realtime environments change even as agents perform action inference and learning, thus
requiring high interaction frequencies to effectively minimize long-term regret. However …

Finite-set direct torque control via edge computing-assisted safe reinforcement learning for a permanent magnet synchronous motor

M Schenke, B Haucke-Korber… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Advances in the field of reinforcement learning (RL)-based drive control allow formulation of
holistic optimization goals for the data-driven training phase. The resulting controllers …

Revisiting state augmentation methods for reinforcement learning with stochastic delays

S Nath, M Baranwal, H Khadilkar - Proceedings of the 30th ACM …, 2021 - dl.acm.org
Several real-world scenarios, such as remote control and sensing, are comprised of action
and observation delays. The presence of delays degrades the performance of reinforcement …

Delayed reinforcement learning by imitation

P Liotet, D Maran, L Bisi… - … Conference on Machine …, 2022 - proceedings.mlr.press
When the agent's observations or interactions are delayed, classic reinforcement learning
tools usually fail. In this paper, we propose a simple yet new and efficient solution to this …

Scalable reinforcement learning framework for traffic signal control under communication delays

A Pang, M Wang, Y Chen, MO Pun… - IEEE Open Journal of …, 2024 - ieeexplore.ieee.org
Vehicle-to-everything (V2X) technology is pivotal for enhancing road safety, traffic efficiency,
and energy conservation through the communication of vehicles with their surrounding …