- Academic Search

J Kaddour, L Liu, R Silva… - Advances in Neural …, 2022 - proceedings.neurips.cc

Recently, flat-minima optimizers, which seek to find parameters in low-loss neighborhoods,
have been shown to improve a neural network's generalization performance over stochastic …

保存引用被引用数: 80 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Temperature balancing, layer-wise weight analysis, and neural network training

Y Zhou, T Pang, K Liu… - Advances in Neural …, 2024 - proceedings.neurips.cc

Regularization in modern machine learning is crucial, and it can take various forms in
algorithmic design: training set, model family, error function, regularization terms, and …

保存引用被引用数: 11 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] acm.org

Test accuracy vs. generalization gap: Model selection in nlp without accessing training or testing data

Y Yang, R Theisen, L Hodgkinson… - Proceedings of the 29th …, 2023 - dl.acm.org

Selecting suitable architecture parameters and training hyperparameters is essential for
enhancing machine learning (ML) model performance. Several recent empirical studies …

保存引用被引用数: 17 関連記事全 2 バージョン

[Free GPT-4]

[PDF] mdpi.com

Stochastic weight averaging revisited

H Guo, J **, B Liu - Applied Sciences, 2023 - mdpi.com

Averaging neural network weights sampled by a backbone stochastic gradient descent
(SGD) is a simple-yet-effective approach to assist the backbone SGD in finding better …

保存引用被引用数: 34 関連記事全 8 バージョンキャッシュ

[Free GPT-4]

[PDF] neurips.cc

When are ensembles really effective?

R Theisen, H Kim, Y Yang… - Advances in …, 2024 - proceedings.neurips.cc

Ensembling has a long history in statistical data analysis, with many impactful applications.
However, in many modern machine learning settings, the benefits of ensembling are less …

保存引用被引用数: 15 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Understanding robust learning through the lens of representation similarities

C Cianfarani, AN Bhagoji, V Sehwag… - Advances in …, 2022 - proceedings.neurips.cc

Abstract Representation learning,\textit {ie} the generation of representations useful for
downstream applications, is a task of fundamental importance that underlies much of the …

保存引用被引用数: 12 関連記事全 10 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Minimum norm interpolation by perceptra: Explicit regularization and implicit bias

J Park, I Pelakh, S Wojtowytsch - Advances in Neural …, 2023 - proceedings.neurips.cc

We investigate how shallow ReLU networks interpolate between known regions. Our
analysis shows that empirical risk minimizers converge to a minimum norm interpolant as …

保存引用被引用数: 3 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data

Y Yang, R Theisen, L Hodgkinson, JE Gonzalez… - ar** Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond

A Jeffares, A Curth, M van der Schaar - arxiv preprint arxiv:2411.00247, 2024 - arxiv.org

Deep learning sometimes appears to work in unexpected ways. In pursuit of a deeper
understanding of its surprising behaviors, we investigate the utility of a simple yet accurate …

保存引用被引用数: 1 関連記事全 2 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Taxonomizing local versus global structure in neural network loss landscapes

When do flat minima optimizers work?

Temperature balancing, layer-wise weight analysis, and neural network training

Test accuracy vs. generalization gap: Model selection in nlp without accessing training or testing data

Stochastic weight averaging revisited

When are ensembles really effective?

Understanding robust learning through the lens of representation similarities

Minimum norm interpolation by perceptra: Explicit regularization and implicit bias

Evaluating natural language processing models with generalization metrics that do not need access to any training or testing data