- Academic Search

TP Lillicrap, JJ Hunt, A Pritzel, N Heess, T Erez… - arxiv preprint arxiv …, 2015 - arxiv.org

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action
domain. We present an actor-critic, model-free algorithm based on the deterministic policy …

Opslaan Citeren Geciteerd door 18612 Verwante artikelen Alle 14 versies HTML-versie

Multi-agent deep reinforcement learning based transmission latency minimization for delay-sensitive cognitive satellite-UAV networks

S Guo, X Zhao - IEEE Transactions on Communications, 2022 - ieeexplore.ieee.org

With the ubiquitous deployment of a massive number of Internet-of-Things (IoT) devices, the
satellite-aerial networks are becoming a promising candidate to provide flexible and …

Opslaan Citeren Geciteerd door 55 Verwante artikelen

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning locomotion skills using deeprl: Does the choice of action space matter?

XB Peng, M Van De Panne - Proceedings of the ACM SIGGRAPH …, 2017 - dl.acm.org

The use of deep reinforcement learning allows for high-dimensional state descriptors, but
little is known about how the choice of action representation impacts learning and the …

Opslaan Citeren Geciteerd door 227 Verwante artikelen Alle 9 versies

A systematic method for the optimization of gas supply reliability in natural gas pipeline network based on Bayesian networks and deep reinforcement learning

L Fan, H Su, W Wang, E Zio, L Zhang, Z Yang… - Reliability Engineering & …, 2022 - Elsevier

This study proposes a method based on Bayesian networks (BNs) to optimize the reliability
of gas supply in natural gas pipeline networks. The method integrates probabilistic safety …

Opslaan Citeren Geciteerd door 51 Verwante artikelen Alle 7 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Jointly learning to construct and control agents using deep reinforcement learning

C Schaff, D Yunis, A Chakrabarti… - … conference on robotics …, 2019 - ieeexplore.ieee.org

The physical design of a robot and the policy that controls its motion are inherently coupled,
and should be determined according to the task and environment. In an increasing number …

Opslaan Citeren Geciteerd door 130 Verwante artikelen Alle 11 versies

[Free GPT-4]
[DeepSeek]

[PDF] plymouth.ac.uk

Toward end-to-end control for UAV autonomous landing via deep reinforcement learning

R Polvara, M Patacchiola, S Sharma… - 2018 International …, 2018 - ieeexplore.ieee.org

The autonomous landing of an unmanned aerial vehicle (UAV) is still an open problem.
Previous work focused on the use of hand-crafted geometric features and sensor-data fusion …

Opslaan Citeren Geciteerd door 118 Verwante artikelen Alle 8 versies

Deep reinforcement learning optimal transmission algorithm for cognitive Internet of Things with RF energy harvesting

S Guo, X Zhao - IEEE Transactions on Cognitive …, 2022 - ieeexplore.ieee.org

Spectrum scarcity and energy limitation are becoming two critical issues in designing
Internet of Things (IoT). As two promising technologies, cognitive radio (CR) and radio …

Opslaan Citeren Geciteerd door 40 Verwante artikelen Alle 2 versies

A control strategy of autonomous vehicles based on deep reinforcement learning

W **a, H Li, B Li - 2016 9th International Symposium on …, 2016 - ieeexplore.ieee.org

Deep reinforcement learning has received considerable attention after the outstanding
performance of AlphaGo. In this paper, we propose a new control strategy of self-driving …

Opslaan Citeren Geciteerd door 102 Verwante artikelen Alle 2 versies

Throughput maximization for RF powered cognitive noma networks with backscatter communication by deep reinforcement learning

S Guo, X Zhao, W Zhang - IEEE Transactions on Wireless …, 2023 - ieeexplore.ieee.org

In this paper, we present a hybrid ambient backscatter communication (ABC) assisted
framework for radio frequency (RF) powered cognitive radio networks (CRNs). In these …

Opslaan Citeren Geciteerd door 12 Verwante artikelen Alle 2 versies

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Sampled-data control through model-free reinforcement learning with effective experience replay

B **ao, HK Lam, X Su, Z Wang, FPW Lo, S Chen… - Journal of Automation …, 2023 - Elsevier

Reinforcement Learning (RL) based control algorithms can learn the control strategies for
nonlinear and uncertain environment during interacting with it. Guided by the rewards …

Opslaan Citeren Geciteerd door 22 Verwante artikelen Alle 2 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Autonomous reinforcement learning with experience replay

Continuous control with deep reinforcement learning

Multi-agent deep reinforcement learning based transmission latency minimization for delay-sensitive cognitive satellite-UAV networks

Learning locomotion skills using deeprl: Does the choice of action space matter?

A systematic method for the optimization of gas supply reliability in natural gas pipeline network based on Bayesian networks and deep reinforcement learning

Jointly learning to construct and control agents using deep reinforcement learning

Toward end-to-end control for UAV autonomous landing via deep reinforcement learning

Deep reinforcement learning optimal transmission algorithm for cognitive Internet of Things with RF energy harvesting

A control strategy of autonomous vehicles based on deep reinforcement learning

Throughput maximization for RF powered cognitive noma networks with backscatter communication by deep reinforcement learning

[HTML][HTML] Sampled-data control through model-free reinforcement learning with effective experience replay