Google Académico

M Faldor, F Chalumeau, M Flageat, A Cully - arxiv preprint arxiv …, 2023 - arxiv.org

A hallmark of intelligence is the ability to exhibit a wide range of effective behaviors. Inspired
by this principle, Quality-Diversity algorithms, such as MAP-Elites, are evolutionary methods …

Guardar Citar Citado por 8 Artigos relacionados Todas as 5 versões Ver em HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Z Wang, B Wang, H **g, H Li, H Dou - arxiv preprint arxiv:2408.01880, 2024 - arxiv.org

Recent years, multi-hop reasoning has been widely studied for knowledge graph (KG)
reasoning due to its efficacy and interpretability. However, previous multi-hop reasoning …

Guardar Citar Citado por 1 Artigos relacionados Todas as 2 versões Ver em HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The impact of intrinsic rewards on exploration in Reinforcement Learning

A Kayal, E Pignatelli, L Toni - arxiv preprint arxiv:2501.11533, 2025 - arxiv.org

One of the open challenges in Reinforcement Learning is the hard exploration problem in
sparse reward environments. Various types of intrinsic rewards have been proposed to …

Guardar Citar Artigos relacionados Todas as 2 versões Ver em HTML

Criar alerta

Citar

Pesquisa avançada

Guardado em A minha biblioteca

Quality-diversity actor-critic: learning high-performing and diverse behaviors via value...

Synergizing quality-diversity with descriptor-conditioned reinforcement learning

Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

The impact of intrinsic rewards on exploration in Reinforcement Learning