Rlops: Development life-cycle of reinforcement learning aided open ran

P Li, J Thomas, X Wang, A Khalil, A Ahmad… - IEEE …, 2022 - ieeexplore.ieee.org
Radio access network (RAN) technologies continue to evolve, with Open RAN gaining the
most recent momentum. In the O-RAN specifications, the RAN intelligent controllers (RICs) …

Action noise in off-policy deep reinforcement learning: Impact on exploration and performance

J Hollenstein, S Auddy, M Saveriano… - arxiv preprint arxiv …, 2022 - arxiv.org
Many Deep Reinforcement Learning (D-RL) algorithms rely on simple forms of exploration
such as the additive action noise often used in continuous control domains. Typically, the …

[HTML][HTML] Decentralized control and local information for robust and adaptive decentralized deep reinforcement learning

M Schilling, A Melnik, FW Ohl, HJ Ritter, B Hammer - Neural Networks, 2021 - Elsevier
Decentralization is a central characteristic of biological motor control that allows for fast
responses relying on local sensory information. In contrast, the current trend of Deep …

Streaming Deep Reinforcement Learning Finally Works

M Elsayed, G Vasan, AR Mahmood - arxiv preprint arxiv:2410.14606, 2024 - arxiv.org
Natural intelligence processes experience as a continuous stream, sensing, acting, and
learning moment-by-moment in real time. Streaming learning, the modus operandi of classic …

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

G Vasan, M Elsayed, A Azimi, J He, F Shariar… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern deep policy gradient methods achieve effective performance on simulated robotic
tasks, but they all require large replay buffers or expensive batch updates, or both, making …

Learning vision-based reactive policies for obstacle avoidance

E Aljalbout, J Chen, K Ritt, M Ulmer… - Conference on Robot …, 2021 - proceedings.mlr.press
In this paper, we address the problem of vision-based obstacle avoidance for robotic
manipulators. This topic poses challenges for both perception and motion generation. While …

[PDF][PDF] Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization

S Sood, K Papasotiriou, M Vaiciulis… - FinPlan, 2023 - icaps23.icaps-conference.org
Portfolio Management is the process of overseeing a group of investments, referred to as a
portfolio, with the objective of achieving predetermined investment goals and objectives …

Reinforcement learning with formal performance metrics for quadcopter attitude control under non-nominal contexts

N Bernini, M Bessa, R Delmas, A Gold… - … Applications of Artificial …, 2024 - Elsevier
We explore the reinforcement learning approach to designing controllers by extensively
discussing the case of a quadcopter attitude controller. We provide all details allowing to …

Learning state and action spaces for robot learning

E Aljalbout - 2024 - mediatum.ub.tum.de
Robot learning is a powerful paradigm for skill acquisition in robotics. Learning such skills
based on data can be very expensive in terms of interactions with the environment …

Intelligent DRL-Based Adaptive Region of Interest for Delay-sensitive Telemedicine Applications

A Soliman, A Mohamed, E Yaacoub… - ICC 2023-IEEE …, 2023 - ieeexplore.ieee.org
Telemedicine applications have recently received substantial potential and interest,
especially after the COVID-19 pandemic. Remote experience will help people get their …