Continuous control with deep reinforcement learning

TP Lillicrap, JJ Hunt, A Pritzel, N Heess, T Erez… - arxiv preprint arxiv …, 2015 - arxiv.org
We adapt the ideas underlying the success of Deep Q-Learning to the continuous action
domain. We present an actor-critic, model-free algorithm based on the deterministic policy …

Multi-agent deep reinforcement learning based transmission latency minimization for delay-sensitive cognitive satellite-UAV networks

S Guo, X Zhao - IEEE Transactions on Communications, 2022 - ieeexplore.ieee.org
With the ubiquitous deployment of a massive number of Internet-of-Things (IoT) devices, the
satellite-aerial networks are becoming a promising candidate to provide flexible and …

Learning locomotion skills using deeprl: Does the choice of action space matter?

XB Peng, M Van De Panne - Proceedings of the ACM SIGGRAPH …, 2017 - dl.acm.org
The use of deep reinforcement learning allows for high-dimensional state descriptors, but
little is known about how the choice of action representation impacts learning and the …

A systematic method for the optimization of gas supply reliability in natural gas pipeline network based on Bayesian networks and deep reinforcement learning

L Fan, H Su, W Wang, E Zio, L Zhang, Z Yang… - Reliability Engineering & …, 2022 - Elsevier
This study proposes a method based on Bayesian networks (BNs) to optimize the reliability
of gas supply in natural gas pipeline networks. The method integrates probabilistic safety …

Jointly learning to construct and control agents using deep reinforcement learning

C Schaff, D Yunis, A Chakrabarti… - … conference on robotics …, 2019 - ieeexplore.ieee.org
The physical design of a robot and the policy that controls its motion are inherently coupled,
and should be determined according to the task and environment. In an increasing number …

Toward end-to-end control for UAV autonomous landing via deep reinforcement learning

R Polvara, M Patacchiola, S Sharma… - 2018 International …, 2018 - ieeexplore.ieee.org
The autonomous landing of an unmanned aerial vehicle (UAV) is still an open problem.
Previous work focused on the use of hand-crafted geometric features and sensor-data fusion …

Deep reinforcement learning optimal transmission algorithm for cognitive Internet of Things with RF energy harvesting

S Guo, X Zhao - IEEE Transactions on Cognitive …, 2022 - ieeexplore.ieee.org
Spectrum scarcity and energy limitation are becoming two critical issues in designing
Internet of Things (IoT). As two promising technologies, cognitive radio (CR) and radio …

A control strategy of autonomous vehicles based on deep reinforcement learning

W **a, H Li, B Li - 2016 9th International Symposium on …, 2016 - ieeexplore.ieee.org
Deep reinforcement learning has received considerable attention after the outstanding
performance of AlphaGo. In this paper, we propose a new control strategy of self-driving …

Throughput maximization for RF powered cognitive noma networks with backscatter communication by deep reinforcement learning

S Guo, X Zhao, W Zhang - IEEE Transactions on Wireless …, 2023 - ieeexplore.ieee.org
In this paper, we present a hybrid ambient backscatter communication (ABC) assisted
framework for radio frequency (RF) powered cognitive radio networks (CRNs). In these …

[HTML][HTML] Sampled-data control through model-free reinforcement learning with effective experience replay

B **ao, HK Lam, X Su, Z Wang, FPW Lo, S Chen… - Journal of Automation …, 2023 - Elsevier
Reinforcement Learning (RL) based control algorithms can learn the control strategies for
nonlinear and uncertain environment during interacting with it. Guided by the rewards …