Google Académico

D Aristoff, M Johnson, G Simpson… - The Journal of chemical …, 2024 - pubs.aip.org

In the study of stochastic systems, the committor function describes the probability that a
system starting from an initial configuration x will reach a set B before a set A. This paper …

Guardar Citar Citado por 6 Artículos relacionados Las 4 versiones

[Free GPT-4]

[PDF] arxiv.org

Wide neural networks trained with weight decay provably exhibit neural collapse

A Jacot, P Súkeník, Z Wang, M Mondelli - arxiv preprint arxiv:2410.04887, 2024 - arxiv.org

Deep neural networks (DNNs) at convergence consistently represent the training data in the
last layer via a highly symmetric geometric structure referred to as neural collapse. This …

Guardar Citar Citado por 2 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Emergence in non-neural models: grokking modular arithmetic via average gradient outer product

N Mallinar, D Beaglehole, L Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org

Neural networks trained to solve modular arithmetic tasks exhibit grokking, a phenomenon
where the test accuracy starts improving long after the model achieves 100% training …

Guardar Citar Citado por 3 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] ista.ac.at

Neural collapse vs. low-rank bias: Is deep neural collapse really optimal?

P Súkeník, C Lampert… - … on Neural Information …, 2024 - research-explorer.ista.ac.at

Deep neural networks (DNNs) exhibit a surprising structure in their final layer known as
neural collapse (NC), and a growing body of works has currently investigated the …

Guardar Citar Citado por 1 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks

J Zhao, SP Singh, A Lucchi - arxiv preprint arxiv:2411.02139, 2024 - arxiv.org

The Gauss-Newton (GN) matrix plays an important role in machine learning, most evident in
its use as a preconditioning matrix for a wide family of popular adaptive methods to speed …

Guardar Citar Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Neural Collapse versus Low-rank Bias: Is Deep Neural Collapse Really Optimal?

P Súkeník, M Mondelli, C Lampert - arxiv preprint arxiv:2405.14468, 2024 - arxiv.org

Deep neural networks (DNNs) exhibit a surprising structure in their final layer known as
neural collapse (NC), and a growing body of works has currently investigated the …

Guardar Citar Citado por 3 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation

C Yaras, P Wang, L Balzano, Q Qu - arxiv preprint arxiv:2406.04112, 2024 - arxiv.org

While overparameterization in machine learning models offers great benefits in terms of
optimization and generalization, it also leads to increased computational requirements as …

Guardar Citar Citado por 12 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

The Persistence of Neural Collapse Despite Low-Rank Bias: An Analytic Perspective Through Unconstrained Features

C Garrod, JP Keating - arxiv preprint arxiv:2410.23169, 2024 - arxiv.org

Modern deep neural networks have been observed to exhibit a simple structure in their final
layer features and weights, commonly referred to as neural collapse. This phenomenon has …

Guardar Citar Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Neural Collapse Beyond the Unconstrainted Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime

D Wu, M Mondelli - arxiv preprint arxiv:2501.19104, 2025 - arxiv.org

Neural Collapse is a phenomenon where the last-layer representations of a well-trained
neural network converge to a highly structured geometry. In this paper, we focus on its first …

Guardar Citar Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] openreview.net

Neural Collapse Inspired Feature Alignment for Out-of-Distribution Generalization

Z Chen, M Zhang, S Cui, H Li, G Niu, M Gong… - The Thirty-eighth Annual … - openreview.net

The spurious correlation between the background features of the image and its label arises
due to that the samples labeled with the same class in the training set often co-occurs with a …

Guardar Citar Artículos relacionados Las 2 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Average gradient outer product as a mechanism for deep neural collapse

The fast committor machine: Interpretable prediction with kernels

Wide neural networks trained with weight decay provably exhibit neural collapse

Emergence in non-neural models: grokking modular arithmetic via average gradient outer product

Neural collapse vs. low-rank bias: Is deep neural collapse really optimal?

Theoretical characterisation of the Gauss-Newton conditioning in Neural Networks

Neural Collapse versus Low-rank Bias: Is Deep Neural Collapse Really Optimal?

Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation

The Persistence of Neural Collapse Despite Low-Rank Bias: An Analytic Perspective Through Unconstrained Features

Neural Collapse Beyond the Unconstrainted Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime

Neural Collapse Inspired Feature Alignment for Out-of-Distribution Generalization