Reinforcement learning techniques for optimal power control in grid-connected microgrids: A comprehensive review

EO Arwa, KA Folly - Ieee Access, 2020 - ieeexplore.ieee.org
Utility grids are undergoing several upgrades. Distributed generators that are supplied by
intermittent renewable energy sources (RES) are being connected to the grids. As RES get …

Deep reinforcement learning-based air combat maneuver decision-making: literature review, implementation tutorial and future direction

X Wang, Y Wang, X Su, L Wang, C Lu, H Peng… - Artificial Intelligence …, 2024 - Springer
Nowadays, various innovative air combat paradigms that rely on unmanned aerial vehicles
(UAVs), ie, UAV swarm and UAV-manned aircraft cooperation, have received great attention …

Strategies for controlling microgrid networks with energy storage systems: A review

M Al-Saadi, M Al-Greer, M Short - Energies, 2021 - mdpi.com
Distributed Energy Storage Systems are considered key enablers in the transition from the
traditional centralized power system to a smarter, autonomous, and decentralized system …

Self-contrastive Learning-optimized General Agent for long-tailed fault diagnosis of shipboard antennas leveraging adaptive data distribution

Q Cui, S He, C Hu, J Bao, Y Peng, J Chen - Measurement, 2025 - Elsevier
To address the challenges of low accuracy and limited generalization in long-tailed fault
diagnosis, an adaptive data distribution-based reinforcement learning General Agent is …

Mobile Robot Navigation Based on Noisy N-Step Dueling Double Deep Q-Network and Prioritized Experience Replay

W Hu, Y Zhou, HW Ho - Electronics, 2024 - mdpi.com
Effective real-time autonomous navigation for mobile robots in static and dynamic
environments has become a challenging and active research topic. Although the …

[HTML][HTML] Improved exploration–exploitation trade-off through adaptive prioritized experience replay

H Hassani, S Nikan, A Shami - Neurocomputing, 2025 - Elsevier
Experience replay is an indispensable part of deep reinforcement learning algorithms that
enables the agent to revisit and reuse its past and recent experiences to update the network …

Prioritized Generative Replay

R Wang, K Frans, P Abbeel, S Levine… - arxiv preprint arxiv …, 2024 - arxiv.org
Sample-efficient online reinforcement learning often uses replay buffers to store experience
for reuse when updating the value function. However, uniform replay is inefficient, since …

Prioritized experience replay based on multi-armed bandit

X Liu, T Zhu, C Jiang, D Ye, F Zhao - Expert Systems with Applications, 2022 - Elsevier
Experience replay has been widely used in deep reinforcement learning. The learning
algorithm allows online reinforcement learning agents to remember and reuse experiences …

Self-adaptive priority correction for prioritized experience replay

H Zhang, C Qu, J Zhang, J Li - Applied sciences, 2020 - mdpi.com
Deep Reinforcement Learning (DRL) is a promising approach for general artificial
intelligence. However, most DRL methods suffer from the problem of data inefficiency. To …

Deep reinforcement learning-based service-oriented resource allocation in smart grids

L **, Y Wang, Y Wang, Z Wang, X Wang, Y Chen - IEEE Access, 2021 - ieeexplore.ieee.org
Resource allocation has a direct and profound impact on the performance of resource-
limited smart grids with diversified services that need to be timely processed. In this paper …