Google Академія

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

Зберегти Послатися Цитовано в 128 джерелах Пов’язані статті Кількість версій: 9

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Task phasing: Automated curriculum learning from demonstrations

V Bajaj, G Sharon, P Stone - Proceedings of the International …, 2023 - ojs.aaai.org

Applying reinforcement learning (RL) to sparse reward domains is notoriously challenging
due to insufficient guiding signals. Common RL techniques for addressing such domains …

Зберегти Послатися Цитовано в 6 джерелах Пов’язані статті Кількість версій: 12 Показати у форматі HTML

Entropy regularization methods for parameter space exploration

S Han, W Zhou, S Lü, S Zhu, X Gong - Information Sciences, 2023 - Elsevier

Entropy regularization is an important approach to improve exploration and enhance policy
stability for reinforcement learning. However, in previous study, entropy regularization is …

Зберегти Послатися Цитовано в 5 джерелах Пов’язані статті Кількість версій: 2

Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

PG Ye, J Zheng, X Ren, J Huang, Z Zhang, Y Pang… - Information …, 2024 - Elsevier

The deployment of unmanned aerial vehicles (UAVs) in ultra-dense networks (UNDs) has
significantly advanced network capabilities in 5G/6G environments, addressing coverage …

Зберегти Послатися Пов’язані статті Кількість версій: 4

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Mnemonic Dictionary Learning for Intrinsic Motivation in Reinforcement Learning

R Yan, Z Wu, Y Zhan, P Tao, Z Wang… - … Joint Conference on …, 2023 - ieeexplore.ieee.org

Reinforcement learning for hard-exploration tasks remains challenging due to the long-term
dependence and sparse-and-delay rewards in complex environments. In these challenging …

Зберегти Послатися Цитовано в 1 джерелах Пов’язані статті Кількість версій: 3

Autoencoder Reconstruction Model for Long-Horizon Exploration

R Yan, Y Wu, Y Gan, Y Yang, Z Yu, Z Liu… - … Joint Conference on …, 2024 - ieeexplore.ieee.org

Conventional reinforcement learning (RL) algorithms often necessitate millions of
environment interactions to ascertain an efficacious policy. In stark contrast, humans …

Зберегти Послатися Пов’язані статті Кількість версій: 2

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Generative subgoal oriented multi-agent reinforcement learning through potential field

S Li, H Jiang, Y Liu, J Zhang, X Xu, D Liu - Neural Networks, 2024 - Elsevier

Multi-agent reinforcement learning (MARL) effectively improves the learning speed of agents
in sparse reward tasks with the guide of subgoals. However, existing works sever the …

Зберегти Послатися Пов’язані статті Кількість версій: 5

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Potential driven reinforcement learning for hard exploration tasks

Exploration in deep reinforcement learning: From single-agent to multiagent domain

Task phasing: Automated curriculum learning from demonstrations

Entropy regularization methods for parameter space exploration

Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

Mnemonic Dictionary Learning for Intrinsic Motivation in Reinforcement Learning

Autoencoder Reconstruction Model for Long-Horizon Exploration

[HTML][HTML] Generative subgoal oriented multi-agent reinforcement learning through potential field