„Google“ mokslinčius

A d'Aspremont, D Scieur, A Taylor - Foundations and Trends® …, 2021 - nowpublishers.com

This monograph covers some recent advances in a range of acceleration techniques
frequently used in convex optimization. We first use quadratic optimization problems to …

Išsaugoti Cituoti Cituoja 169 Susiję straipsniai Visos 11 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Sgd in the large: Average-case analysis, asymptotics, and stepsize criticality

C Paquette, K Lee, F Pedregosa… - … on Learning Theory, 2021 - proceedings.mlr.press

We propose a new framework, inspired by random matrix theory, for analyzing the dynamics
of stochastic gradient descent (SGD) when both number of samples and dimensions are …

Išsaugoti Cituoti Cituoja 54 Susiję straipsniai Visos 5 versijos HTML kopija

Halting time is predictable for large models: A universality property and average-case analysis

C Paquette, B van Merriënboer, E Paquette… - Foundations of …, 2023 - Springer

Average-case analysis computes the complexity of an algorithm averaged over all possible
inputs. Compared to worst-case analysis, it is more representative of the typical behavior of …

Išsaugoti Cituoti Cituoja 28 Susiję straipsniai Visos 3 versijos

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Acceleration through spectral density estimation

F Pedregosa, D Scieur - International Conference on …, 2020 - proceedings.mlr.press

We develop a framework for the average-case analysis of random quadratic problems and
derive algorithms that are optimal under this analysis. This yields a new class of methods …

Išsaugoti Cituoti Cituoja 49 Susiję straipsniai Visos 8 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Universal average-case optimality of Polyak momentum

D Scieur, F Pedregosa - International conference on …, 2020 - proceedings.mlr.press

Polyak momentum (PM), also known as the heavy-ball method, is a widely used optimization
method that enjoys an asymptotic optimal worst-case complexity on quadratic objectives …

Išsaugoti Cituoti Cituoja 41 Susiję straipsniai Visos 10 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Debiasing distributed second order optimization with surrogate sketching and scaled regularization

M Derezinski, B Bartan, M Pilanci… - Advances in Neural …, 2020 - proceedings.neurips.cc

In distributed second order optimization, a standard strategy is to average many local
estimates, each of which is based on a small sketch or batch of the data. However, the local …

Išsaugoti Cituoti Cituoja 32 Susiję straipsniai Visos 9 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Effective dimension adaptive sketching methods for faster regularized least-squares optimization

J Lacotte, M Pilanci - Advances in neural information …, 2020 - proceedings.neurips.cc

We propose a new randomized algorithm for solving L2-regularized least-squares problems
based on sketching. We consider two of the most popular random embeddings, namely …

Išsaugoti Cituoti Cituoja 30 Susiję straipsniai Visos 7 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Conformal frequency estimation using discrete sketched data with coverage for distinct queries

M Sesia, S Favaro, E Dobriban - Journal of Machine Learning Research, 2023 - jmlr.org

This paper develops conformal inference methods to construct a confidence interval for the
frequency of a queried object in a very large discrete data set, based on a sketch with a …

Išsaugoti Cituoti Cituoja 10 Susiję straipsniai Visos 7 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Training quantized neural networks to global optimality via semidefinite programming

B Bartan, M Pilanci - International Conference on Machine …, 2021 - proceedings.mlr.press

Neural networks (NNs) have been extremely successful across many tasks in machine
learning. Quantization of NN weights has become an important topic due to its impact on …

Išsaugoti Cituoti Cituoja 15 Susiję straipsniai Visos 6 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Faster least squares optimization

J Lacotte, M Pilanci - arxiv preprint arxiv:1911.02675, 2019 - arxiv.org

We investigate iterative methods with randomized preconditioners for solving
overdetermined least-squares problems, where the preconditioners are based on a random …

Išsaugoti Cituoti Cituoja 26 Susiję straipsniai Visos 2 versijos HTML kopija

Kurti įspėjimą

Cituoti

Išplėstinė paieška

Išsaugota skiltyje „Mano biblioteka“

Optimal randomized first-order methods for least-squares problems

Acceleration methods

Sgd in the large: Average-case analysis, asymptotics, and stepsize criticality

Halting time is predictable for large models: A universality property and average-case analysis

Acceleration through spectral density estimation

Universal average-case optimality of Polyak momentum

Debiasing distributed second order optimization with surrogate sketching and scaled regularization

Effective dimension adaptive sketching methods for faster regularized least-squares optimization

Conformal frequency estimation using discrete sketched data with coverage for distinct queries

Training quantized neural networks to global optimality via semidefinite programming

Faster least squares optimization