Continual learning: Applications and the road forward

E Verwimp, R Aljundi, S Ben-David, M Bethge… - arxiv preprint arxiv …, 2023 - arxiv.org
Continual learning is a subfield of machine learning, which aims to allow machine learning
models to continuously learn on new data, by accumulating knowledge without forgetting …

Towards certifiable ai in aviation: landscape, challenges, and opportunities

H Bello, D Geißler, L Ray, S Müller-Divéky… - arxiv preprint arxiv …, 2024 - arxiv.org
Artificial Intelligence (AI) methods are powerful tools for various domains, including critical
fields such as avionics, where certification is required to achieve and maintain an …

Hamilton-jacobi reachability in reinforcement learning: A survey

M Ganai, S Gao, S Herbert - IEEE Open Journal of Control …, 2024 - ieeexplore.ieee.org
Recent literature has proposed approaches that learn control policies with high performance
while maintaining safety guarantees. Synthesizing Hamilton-Jacobi (HJ) reachable sets has …

Prediction and control in continual reinforcement learning

N Anand, D Precup - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Temporal difference (TD) learning is often used to update the estimate of the value function
which is used by RL agents to extract useful policies. In this paper, we focus on value …

Fast trac: A parameter-free optimizer for lifelong reinforcement learning

A Muppidi, Z Zhang, H Yang - Advances in Neural …, 2025 - proceedings.neurips.cc
A key challenge in lifelong reinforcement learning (RL) is the loss of plasticity, where
previous learning progress hinders an agent's adaptation to new tasks. While regularization …

A survey of temporal credit assignment in deep reinforcement learning

E Pignatelli, J Ferret, M Geist, T Mesnard… - arxiv preprint arxiv …, 2023 - arxiv.org
The Credit Assignment Problem (CAP) refers to the longstanding challenge of
Reinforcement Learning (RL) agents to associate actions with their long-term …

A definition of open-ended learning problems for goal-conditioned agents

O Sigaud, G Baldassarre, C Colas, S Doncieux… - arxiv preprint arxiv …, 2023 - arxiv.org
A lot of recent machine learning research papers have``open-ended learning''in their title.
But very few of them attempt to define what they mean when using the term. Even worse …

Three dogmas of reinforcement learning

D Abel, MK Ho, A Harutyunyan - arxiv preprint arxiv:2407.10583, 2024 - arxiv.org
Modern reinforcement learning has been conditioned by at least three dogmas. The first is
the environment spotlight, which refers to our tendency to focus on modeling environments …

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arxiv preprint arxiv:2312.01058, 2023 - arxiv.org
Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Evolving Alignment via Asymmetric Self-Play

Z Ye, R Agarwal, T Liu, R Joshi, S Velury, QV Le… - arxiv preprint arxiv …, 2024 - arxiv.org
Current RLHF frameworks for aligning large language models (LLMs) typically assume a
fixed prompt distribution, which is sub-optimal and limits the scalability of alignment and …