Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Gradient starvation: A learning proclivity in neural networks
We identify and formalize a fundamental gradient descent phenomenon resulting in a
learning proclivity in over-parameterized neural networks. Gradient Starvation arises when …
learning proclivity in over-parameterized neural networks. Gradient Starvation arises when …
An empirical study of example forgetting during deep neural network learning
Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics
of neural networks as they train on single classification tasks. Our goal is to understand …
of neural networks as they train on single classification tasks. Our goal is to understand …
Correct-n-contrast: A contrastive approach for improving robustness to spurious correlations
Spurious correlations pose a major challenge for robust machine learning. Models trained
with empirical risk minimization (ERM) may learn to rely on correlations between class …
with empirical risk minimization (ERM) may learn to rely on correlations between class …
Neural redshift: Random networks are not random functions
Our understanding of the generalization capabilities of neural networks NNs is still
incomplete. Prevailing explanations are based on implicit biases of gradient descent GD but …
incomplete. Prevailing explanations are based on implicit biases of gradient descent GD but …
Unsupervised state representation learning in atari
State representation learning, or the ability to capture latent generative factors of an
environment is crucial for building intelligent agents that can perform a wide variety of tasks …
environment is crucial for building intelligent agents that can perform a wide variety of tasks …
On the foundations of shortcut learning
Deep-learning models can extract a rich assortment of features from data. Which features a
model uses depends not only on\emph {predictivity}--how reliably a feature indicates …
model uses depends not only on\emph {predictivity}--how reliably a feature indicates …
Understanding learning dynamics of language models with SVCCA
Research has shown that neural models implicitly encode linguistic features, but there has
been no research showing\emph {how} these encodings arise as the models are trained …
been no research showing\emph {how} these encodings arise as the models are trained …
Understanding visual feature reliance through the lens of complexity
Recent studies suggest that deep learning models' inductive bias towards favoring simpler
features may be an origin of shortcut learning. Yet, there has been limited focus on …
features may be an origin of shortcut learning. Yet, there has been limited focus on …
The implicit bias of depth: How incremental learning drives generalization
A leading hypothesis for the surprising generalization of neural networks is that the
dynamics of gradient descent bias the model towards simple solutions, by searching through …
dynamics of gradient descent bias the model towards simple solutions, by searching through …
An investigation of critical issues in bias mitigation techniques
A critical problem in deep learning is that systems learn inappropriate biases, resulting in
their inability to perform well on minority groups. This has led to the creation of multiple …
their inability to perform well on minority groups. This has led to the creation of multiple …