Normalization techniques in training dnns: Methodology, analysis and application

L Huang, J Qin, Y Zhou, F Zhu, L Liu… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Normalization techniques are essential for accelerating the training and improving the
generalization of deep neural networks (DNNs), and have successfully been used in various …

The primacy bias in deep reinforcement learning

E Nikishin, M Schwarzer, P D'Oro… - International …, 2022 - proceedings.mlr.press
This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a
tendency to rely on early interactions and ignore useful evidence encountered later …

RIS-assisted UAV for fresh data collection in 3D urban environments: A deep reinforcement learning approach

X Fan, M Liu, Y Chen, S Sun, Z Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Dispatching flexible unmanned aerial vehicles (UAVs) to collect data from distributed
Internet-of-Things devices (IoTDs) is expected to be a promising technology to support time …

Bail: Best-action imitation learning for batch deep reinforcement learning

X Chen, Z Zhou, Z Wang, C Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc
There has recently been a surge in research in batch Deep Reinforcement Learning (DRL),
which aims for learning a high-performing policy from a given dataset without additional …

An improved soft actor-critic-based energy management strategy of heavy-duty hybrid electric vehicles with dual-engine system

D Zhang, W Sun, Y Zou, X Zhang, Y Zhang - Energy, 2024 - Elsevier
While deep reinforcement learning (DRL) based energy management strategies (EMSs)
have shown potential for optimizing energy utilization in recent years, challenges such as …

An improved two-stage deep reinforcement learning approach for regulation service disaggregation in a virtual power plant

Z Yi, Y Xu, X Wang, W Gu, H Sun… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Managing numerous distributed energy resources (DERs) within the virtual power plant
(VPP) is challenging due to inaccurate parameters and unknown dynamic characteristics. To …

Vrl3: A data-driven framework for visual deep reinforcement learning

C Wang, X Luo, K Ross, D Li - Advances in Neural …, 2022 - proceedings.neurips.cc
We propose VRL3, a powerful data-driven framework with a simple design for solving
challenging visual deep reinforcement learning (DRL) tasks. We analyze a number of major …

Is bang-bang control all you need? solving continuous control with bernoulli policies

T Seyde, I Gilitschenski, W Schwarting… - Advances in …, 2021 - proceedings.neurips.cc
Reinforcement learning (RL) for continuous control typically employs distributions whose
support covers the entire action space. In this work, we investigate the colloquially known …

An equivalence between loss functions and non-uniform sampling in experience replay

S Fujimoto, D Meger, D Precup - Advances in neural …, 2020 - proceedings.neurips.cc
Abstract Prioritized Experience Replay (PER) is a deep reinforcement learning technique in
which agents learn from transitions sampled with non-uniform probability proportionate to …

Precise atom manipulation through deep reinforcement learning

IJ Chen, M Aapro, A Kipnis, A Ilin, P Liljeroth… - Nature …, 2022 - nature.com
Atomic-scale manipulation in scanning tunneling microscopy has enabled the creation of
quantum states of matter based on artificial structures and extreme miniaturization of …