- Academic Search

A farewell to the bias-variance tradeoff? an overview of the theory of overparameterized machine learning

Y Dar, V Muthukumar, RG Baraniuk - arxiv preprint arxiv:2109.02355, 2021 - arxiv.org

The rapid recent progress in machine learning (ML) has raised a number of scientific
questions that challenge the longstanding dogma of the field. One of the most important …

Uložit Citovat Počet citací tohoto článku: 91 Související články Všechny verze (počet: 2) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The generalization error of random features regression: Precise asymptotics and the double descent curve

S Mei, A Montanari - Communications on Pure and Applied …, 2022 - Wiley Online Library

Deep learning methods operate in regimes that defy the traditional statistical mindset.
Neural network architectures often contain more parameters than training samples, and are …

Uložit Citovat Počet citací tohoto článku: 712 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[HTML] nih.gov

[HTML][HTML] Surprises in high-dimensional ridgeless least squares interpolation

T Hastie, A Montanari, S Rosset, RJ Tibshirani - Annals of statistics, 2022 - ncbi.nlm.nih.gov

Interpolators—estimators that achieve zero training error—have attracted growing attention
in machine learning, mainly because state-of-the art neural networks appear to be models of …

Uložit Citovat Počet citací tohoto článku: 966 Související články Všechny verze (počet: 16)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Random features for kernel approximation: A survey on algorithms, theory, and beyond

F Liu, X Huang, Y Chen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

The class of random features is one of the most popular techniques to speed up kernel
methods in large-scale problems. Related works have been recognized by the NeurIPS Test …

Uložit Citovat Počet citací tohoto článku: 216 Související články Všechny verze (počet: 9)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A model of double descent for high-dimensional binary linear classification

Z Deng, A Kammoun… - Information and Inference …, 2022 - academic.oup.com

We consider a model for logistic regression where only a subset of features of size is used
for training a linear classifier over training samples. The classifier is obtained by running …

Uložit Citovat Počet citací tohoto článku: 184 Související články Všechny verze (počet: 7)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

On the Optimal Weighted Regularization in Overparameterized Linear Regression

D Wu, J Xu - Advances in Neural Information Processing …, 2020 - proceedings.neurips.cc

We consider the linear model $\vy=\vX\vbeta_ {\star}+\vepsilon $ with $\vX\in\mathbb
{R}^{n\times p} $ in the overparameterized regime $ p> n $. We estimate $\vbeta_ {\star} …

Uložit Citovat Počet citací tohoto článku: 181 Související články Všechny verze (počet: 7) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Understanding double descent requires a fine-grained bias-variance decomposition

B Adlam, J Pennington - Advances in neural information …, 2020 - proceedings.neurips.cc

Classical learning theory suggests that the optimal generalization performance of a machine
learning model should occur at an intermediate model complexity, with simpler models …

Uložit Citovat Počet citací tohoto článku: 130 Související články Všechny verze (počet: 7) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Optimal regularization can mitigate double descent

P Nakkiran, P Venkat, S Kakade, T Ma - arxiv preprint arxiv:2003.01897, 2020 - arxiv.org

Recent empirical and theoretical studies have shown that many learning algorithms--from
linear regression to neural networks--can have test performance that is non-monotonic in …

Uložit Citovat Počet citací tohoto článku: 145 Související články Všechny verze (počet: 5) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Finite-sample analysis of interpolating linear classifiers in the overparameterized regime

NS Chatterji, PM Long - Journal of Machine Learning Research, 2021 - jmlr.org

We prove bounds on the population risk of the maximum margin algorithm for two-class
linear classification. For linearly separable training data, the maximum margin algorithm has …

Uložit Citovat Počet citací tohoto článku: 142 Související články Všechny verze (počet: 8) Zobrazit jako HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Overparameterization improves robustness to covariate shift in high dimensions

N Tripuraneni, B Adlam… - Advances in Neural …, 2021 - proceedings.neurips.cc

A significant obstacle in the development of robust machine learning models is\emph
{covariate shift}, a form of distribution shift that occurs when the input distributions of the …

Uložit Citovat Počet citací tohoto článku: 60 Související články Všechny verze (počet: 5) Zobrazit jako HTML

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

A farewell to the bias-variance tradeoff? an overview of the theory of overparameterized machine learning

The generalization error of random features regression: Precise asymptotics and the double descent curve

[HTML][HTML] Surprises in high-dimensional ridgeless least squares interpolation

Random features for kernel approximation: A survey on algorithms, theory, and beyond

A model of double descent for high-dimensional binary linear classification

On the Optimal Weighted Regularization in Overparameterized Linear Regression

Understanding double descent requires a fine-grained bias-variance decomposition

Optimal regularization can mitigate double descent

Finite-sample analysis of interpolating linear classifiers in the overparameterized regime

Overparameterization improves robustness to covariate shift in high dimensions