- Academic Search

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier

This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

Opslaan Citeren Geciteerd door 379 Verwante artikelen Alle 5 versies

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

A review of deep reinforcement learning approaches for smart manufacturing in industry 4.0 and 5.0 framework

A del Real Torres, DS Andreiana, Á Ojeda Roldán… - Applied Sciences, 2022 - mdpi.com

In this review, the industry's current issues regarding intelligent manufacture are presented.
This work presents the status and the potential for the I4. 0 and I5. 0's revolutionary …

Opslaan Citeren Geciteerd door 57 Verwante artikelen Alle 5 versies In cache

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep reinforcement learning for autonomous driving: A survey

BR Kiran, I Sobh, V Talpaert, P Mannion… - IEEE Transactions …, 2021 - ieeexplore.ieee.org

With the development of deep representation learning, the domain of reinforcement learning
(RL) has become a powerful learning framework now capable of learning complex policies …

Opslaan Citeren Geciteerd door 2289 Verwante artikelen Alle 10 versies

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Cooperative exploration for multi-agent deep reinforcement learning

IJ Liu, U Jain, RA Yeh… - … conference on machine …, 2021 - proceedings.mlr.press

Exploration is critical for good results in deep reinforcement learning and has attracted much
attention. However, existing multi-agent deep reinforcement learning algorithms still use …

Opslaan Citeren Geciteerd door 125 Verwante artikelen Alle 9 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of deep RL and IL for autonomous driving policy learning

Z Zhu, H Zhao - IEEE Transactions on Intelligent Transportation …, 2021 - ieeexplore.ieee.org

Autonomous driving (AD) agents generate driving policies based on online perception
results, which are obtained at multiple levels of abstraction, eg, behavior planning, motion …

Opslaan Citeren Geciteerd door 193 Verwante artikelen Alle 5 versies

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Winner takes it all: Training performant RL populations for combinatorial optimization

N Grinsztajn, D Furelos-Blanco… - Advances in …, 2023 - proceedings.neurips.cc

Applying reinforcement learning (RL) to combinatorial optimization problems is attractive as
it removes the need for expert knowledge or pre-solved instances. However, it is unrealistic …

Opslaan Citeren Geciteerd door 29 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hierarchical reinforcement learning for air-to-air combat

AP Pope, JS Ide, D Mićović, H Diaz… - 2021 international …, 2021 - ieeexplore.ieee.org

Artificial Intelligence (AI) is becoming a critical component in the defense industry, as
recently demonstrated by DARPA's AlphaDogfight Trials (ADT). ADT sought to vet the …

Opslaan Citeren Geciteerd door 139 Verwante artikelen Alle 5 versies

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Trajectory diversity for zero-shot coordination

A Lupu, B Cui, H Hu, J Foerster - … conference on machine …, 2021 - proceedings.mlr.press

We study the problem of zero-shot coordination (ZSC), where agents must independently
produce strategies for a collaborative game that are compatible with novel partners not seen …

Opslaan Citeren Geciteerd door 112 Verwante artikelen Alle 7 versies In bibliotheek zoeken HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Effective diversity in population based reinforcement learning

J Parker-Holder, A Pacchiano… - Advances in …, 2020 - proceedings.neurips.cc

Exploration is a key problem in reinforcement learning, since agents can only learn from
data they acquire in the environment. With that in mind, maintaining a population of agents is …

Opslaan Citeren Geciteerd door 173 Verwante artikelen Alle 8 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Policy space diversity for non-transitive games

J Yao, W Liu, H Fu, Y Yang… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Policy-Space Response Oracles (PSRO) is an influential algorithm framework for
approximating a Nash Equilibrium (NE) in multi-agent non-transitive games. Many previous …

Opslaan Citeren Geciteerd door 13 Verwante artikelen Alle 5 versies HTML-versie

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Diversity-driven exploration strategy for deep reinforcement learning

Exploration in deep reinforcement learning: A survey

A review of deep reinforcement learning approaches for smart manufacturing in industry 4.0 and 5.0 framework

Deep reinforcement learning for autonomous driving: A survey

Cooperative exploration for multi-agent deep reinforcement learning

A survey of deep RL and IL for autonomous driving policy learning

Winner takes it all: Training performant RL populations for combinatorial optimization

Hierarchical reinforcement learning for air-to-air combat

Trajectory diversity for zero-shot coordination

Effective diversity in population based reinforcement learning

Policy space diversity for non-transitive games