Controllability-aware unsupervised skill discovery

S Park, K Lee, Y Lee, P Abbeel - arxiv preprint arxiv:2302.05103, 2023 - arxiv.org
One of the key capabilities of intelligent agents is the ability to discover useful skills without
external supervision. However, the current unsupervised skill discovery methods are often …

Stabilizing unsupervised environment design with a learned adversary

I Mediratta, M Jiang, J Parker-Holder… - Conference on …, 2023 - proceedings.mlr.press
A key challenge in training generally-capable agents is the design of training tasks that
facilitate broad generalization and robustness to environment variations. This challenge …

Curricullm: Automatic task curricula design for learning complex robot skills using large language models

K Ryu, Q Liao, Z Li, K Sreenath, N Mehr - arxiv preprint arxiv:2409.18382, 2024 - arxiv.org
Curriculum learning is a training mechanism in reinforcement learning (RL) that facilitates
the achievement of complex policies by progressively increasing the task difficulty during …

Confidence-based curriculum learning for multi-agent path finding

T Phan, J Driscoll, J Romberg, S Koenig - arxiv preprint arxiv:2401.05860, 2024 - arxiv.org
A wide range of real-world applications can be formulated as Multi-Agent Path Finding
(MAPF) problem, where the goal is to find collision-free paths for multiple agents with …

Large language model-driven curriculum design for mobile networks

O Erak, O Alhussein, S Naser… - 2024 IEEE/CIC …, 2024 - ieeexplore.ieee.org
This study introduces an innovative framework that employs large language models (LLMs)
to automate the design and generation of curricula for reinforcement learning (RL). As …

Do as you teach: A multi-teacher approach to self-play in deep reinforcement learning

C Kharyal, SK Gottipati, TK Sinha, F Abdollahi… - Neural Computing and …, 2025 - Springer
A long-running challenge in the reinforcement learning (RL) community has been to train a
goal-conditioned agent in sparse reward environment such that it also generalizes to …

Feasible adversarial robust reinforcement learning for underspecified environments

JB Lanier, S McAleer, P Baldi, R Fox - arxiv preprint arxiv:2207.09597, 2022 - arxiv.org
Robust reinforcement learning (RL) considers the problem of learning policies that perform
well in the worst case among a set of possible environment parameter values. In real-world …

Using a NEAT approach with curriculums for dynamic content generation in video games

D Hind, C Harvey - Personal and Ubiquitous Computing, 2024 - Springer
This paper presents a novel exploration of the use of an evolving neural network approach
to generate dynamic content for video games, specifically for a tower defence game. The …

Diversity induced environment design via self-play

D Li, W Li, P Varakantham - arxiv preprint arxiv:2302.02119, 2023 - arxiv.org
Recent work on designing an appropriate distribution of environments has shown promise
for training effective generally capable agents. Its success is partly because of a form of …

Transferable curricula through difficulty conditioned generators

S Tio, P Varakantham - arxiv preprint arxiv:2306.13028, 2023 - arxiv.org
Advancements in reinforcement learning (RL) have demonstrated superhuman performance
in complex tasks such as Starcraft, Go, Chess etc. However, knowledge transfer from …