Synergizing quality-diversity with descriptor-conditioned reinforcement learning

M Faldor, F Chalumeau, M Flageat, A Cully - arxiv preprint arxiv …, 2023 - arxiv.org
A hallmark of intelligence is the ability to exhibit a wide range of effective behaviors. Inspired
by this principle, Quality-Diversity algorithms, such as MAP-Elites, are evolutionary methods …

Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Z Wang, B Wang, H **g, H Li, H Dou - arxiv preprint arxiv:2408.01880, 2024 - arxiv.org
Recent years, multi-hop reasoning has been widely studied for knowledge graph (KG)
reasoning due to its efficacy and interpretability. However, previous multi-hop reasoning …

The impact of intrinsic rewards on exploration in Reinforcement Learning

A Kayal, E Pignatelli, L Toni - arxiv preprint arxiv:2501.11533, 2025 - arxiv.org
One of the open challenges in Reinforcement Learning is the hard exploration problem in
sparse reward environments. Various types of intrinsic rewards have been proposed to …