A collective AI via lifelong learning and sharing at the edge

A Soltoggio, E Ben-Iwhiwhu, V Braverman… - Nature Machine …, 2024 - nature.com
One vision of a future artificial intelligence (AI) is where many separate units can learn
independently over a lifetime and share their knowledge with each other. The synergy …

Design principles for lifelong learning AI accelerators

D Kudithipudi, A Daram, AM Zyarah, FT Zohora… - Nature …, 2023 - nature.com
Lifelong learning—an agent's ability to learn throughout its lifetime—is a hallmark of
biological learning systems and a central challenge for artificial intelligence (AI). The …

Simple and scalable strategies to continually pre-train large language models

A Ibrahim, B Thérien, K Gupta, ML Richter… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) are routinely pre-trained on billions of tokens, only to start
the process over again once new data becomes available. A much more efficient solution is …

Continual pre-training of large language models: How to (re) warm your model?

K Gupta, B Thérien, A Ibrahim, ML Richter… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) are routinely pre-trained on billions of tokens, only to restart
the process over again once new data becomes available. A much cheaper and more …

Evolving parameterized prompt memory for continual learning

MR Kurniawan, X Song, Z Ma, Y He, Y Gong… - Proceedings of the …, 2024 - ojs.aaai.org
Recent studies have demonstrated the potency of leveraging prompts in Transformers for
continual learning (CL). Nevertheless, employing a discrete key-prompt bottleneck can lead …

Continual learning and catastrophic forgetting

GM van de Ven, N Soures, D Kudithipudi - arxiv preprint arxiv:2403.05175, 2024 - arxiv.org
This book chapter delves into the dynamics of continual learning, which is the process of
incrementally learning from a non-stationary stream of data. Although continual learning is a …

A comprehensive empirical evaluation on online continual learning

A Soutif-Cormerais, A Carta, A Cossu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Online continual learning aims to get closer to a live learning experience by learning directly
on a stream of data with temporally shifting distribution and by storing a minimum amount of …

Online continual learning without the storage constraint

A Prabhu, Z Cai, P Dokania, P Torr, V Koltun… - arxiv preprint arxiv …, 2023 - arxiv.org
Traditional online continual learning (OCL) research has primarily focused on mitigating
catastrophic forgetting with fixed and limited storage allocation throughout an agent's …

Addressing loss of plasticity and catastrophic forgetting in continual learning

M Elsayed, AR Mahmood - arxiv preprint arxiv:2404.00781, 2024 - arxiv.org
Deep representation learning methods struggle with continual learning, suffering from both
catastrophic forgetting of useful units and loss of plasticity, often due to rigid and unuseful …

Clap4clip: Continual learning with probabilistic finetuning for vision-language models

S Jha, D Gong, L Yao - arxiv preprint arxiv:2403.19137, 2024 - arxiv.org
Continual learning (CL) aims to help deep neural networks learn new knowledge while
retaining what has been learned. Owing to their powerful generalizability, pre-trained vision …