- Academic Search

E Verwimp, R Aljundi, S Ben-David, M Bethge… - arxiv preprint arxiv …, 2023 - arxiv.org

Continual learning is a subfield of machine learning, which aims to allow machine learning
models to continuously learn on new data, by accumulating knowledge without forgetting …

Tallenna Viittaa Viittausten määrä 31 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards certifiable ai in aviation: landscape, challenges, and opportunities

H Bello, D Geißler, L Ray, S Müller-Divéky… - arxiv preprint arxiv …, 2024 - arxiv.org

Artificial Intelligence (AI) methods are powerful tools for various domains, including critical
fields such as avionics, where certification is required to achieve and maintain an …

Tallenna Viittaa Viittausten määrä 4 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Hamilton-jacobi reachability in reinforcement learning: A survey

M Ganai, S Gao, S Herbert - IEEE Open Journal of Control …, 2024 - ieeexplore.ieee.org

Recent literature has proposed approaches that learn control policies with high performance
while maintaining safety guarantees. Synthesizing Hamilton-Jacobi (HJ) reachable sets has …

Tallenna Viittaa Viittausten määrä 3 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Prediction and control in continual reinforcement learning

N Anand, D Precup - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Temporal difference (TD) learning is often used to update the estimate of the value function
which is used by RL agents to extract useful policies. In this paper, we focus on value …

Tallenna Viittaa Viittausten määrä 13 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Fast trac: A parameter-free optimizer for lifelong reinforcement learning

A Muppidi, Z Zhang, H Yang - Advances in Neural …, 2025 - proceedings.neurips.cc

A key challenge in lifelong reinforcement learning (RL) is the loss of plasticity, where
previous learning progress hinders an agent's adaptation to new tasks. While regularization …

Tallenna Viittaa Viittausten määrä 2 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of temporal credit assignment in deep reinforcement learning

E Pignatelli, J Ferret, M Geist, T Mesnard… - arxiv preprint arxiv …, 2023 - arxiv.org

The Credit Assignment Problem (CAP) refers to the longstanding challenge of
Reinforcement Learning (RL) agents to associate actions with their long-term …

Tallenna Viittaa Viittausten määrä 12 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A definition of open-ended learning problems for goal-conditioned agents

O Sigaud, G Baldassarre, C Colas, S Doncieux… - arxiv preprint arxiv …, 2023 - arxiv.org

A lot of recent machine learning research papers have``open-ended learning''in their title.
But very few of them attempt to define what they mean when using the term. Even worse …

Tallenna Viittaa Viittausten määrä 11 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Three dogmas of reinforcement learning

D Abel, MK Ho, A Harutyunyan - arxiv preprint arxiv:2407.10583, 2024 - arxiv.org

Modern reinforcement learning has been conditioned by at least three dogmas. The first is
the environment spotlight, which refers to our tendency to focus on modeling environments …

Tallenna Viittaa Viittausten määrä 3 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of progress on cooperative multi-agent reinforcement learning in open environment

L Yuan, Z Zhang, L Li, C Guan, Y Yu - arxiv preprint arxiv:2312.01058, 2023 - arxiv.org

Multi-agent Reinforcement Learning (MARL) has gained wide attention in recent years and
has made progress in various fields. Specifically, cooperative MARL focuses on training a …

Tallenna Viittaa Viittausten määrä 18 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota Välimuistissa

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Evolving Alignment via Asymmetric Self-Play

Z Ye, R Agarwal, T Liu, R Joshi, S Velury, QV Le… - arxiv preprint arxiv …, 2024 - arxiv.org

Current RLHF frameworks for aligning large language models (LLMs) typically assume a
fixed prompt distribution, which is sub-optimal and limits the scalability of alignment and …

Tallenna Viittaa Viittausten määrä 2 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

A definition of continual reinforcement learning

Continual learning: Applications and the road forward

Towards certifiable ai in aviation: landscape, challenges, and opportunities

Hamilton-jacobi reachability in reinforcement learning: A survey

Prediction and control in continual reinforcement learning

Fast trac: A parameter-free optimizer for lifelong reinforcement learning

A survey of temporal credit assignment in deep reinforcement learning

A definition of open-ended learning problems for goal-conditioned agents

Three dogmas of reinforcement learning

A survey of progress on cooperative multi-agent reinforcement learning in open environment

Evolving Alignment via Asymmetric Self-Play