- Academic Search

RM Gower, M Schmidt, F Bach… - Proceedings of the …, 2020 - ieeexplore.ieee.org

Stochastic optimization lies at the heart of machine learning, and its cornerstone is
stochastic gradient descent (SGD), a method introduced over 60 years ago. The last eight …

Opslaan Citeren Geciteerd door 142 Verwante artikelen Alle 14 versies

[Free GPT-4]

[PDF] neurips.cc

Spider: Near-optimal non-convex optimization via stochastic path-integrated differential estimator

C Fang, CJ Li, Z Lin, T Zhang - Advances in neural …, 2018 - proceedings.neurips.cc

In this paper, we propose a new technique named\textit {Stochastic Path-Integrated
Differential EstimatoR}(SPIDER), which can be used to track many deterministic quantities of …

Opslaan Citeren Geciteerd door 676 Verwante artikelen Alle 16 versies HTML-versie

[Free GPT-4]

[PDF] nsf.gov

Lower bounds for non-convex stochastic optimization

Y Arjevani, Y Carmon, JC Duchi, DJ Foster… - Mathematical …, 2023 - Springer

We lower bound the complexity of finding ϵ-stationary points (with gradient norm at most ϵ)
using stochastic first-order methods. In a well-studied model where algorithms access …

Opslaan Citeren Geciteerd door 377 Verwante artikelen Alle 6 versies

[Free GPT-4]

[PDF] neurips.cc

Momentum-based variance reduction in non-convex sgd

A Cutkosky, F Orabona - Advances in neural information …, 2019 - proceedings.neurips.cc

Variance reduction has emerged in recent years as a strong competitor to stochastic
gradient descent in non-convex problems, providing the first algorithms to improve upon the …

Opslaan Citeren Geciteerd door 430 Verwante artikelen Alle 10 versies HTML-versie

[Free GPT-4]

[PDF] neurips.cc

Provably faster algorithms for bilevel optimization

J Yang, K Ji, Y Liang - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Bilevel optimization has been widely applied in many important machine learning
applications such as hyperparameter optimization and meta-learning. Recently, several …

Opslaan Citeren Geciteerd door 147 Verwante artikelen Alle 8 versies HTML-versie

[Free GPT-4]

[PDF] mlr.press

Optimal stochastic non-smooth non-convex optimization through online-to-non-convex conversion

A Cutkosky, H Mehta… - … Conference on Machine …, 2023 - proceedings.mlr.press

We present new algorithms for optimizing non-smooth, non-convex stochastic objectives
based on a novel analysis technique. This improves the current best-known complexity for …

Opslaan Citeren Geciteerd door 41 Verwante artikelen Alle 8 versies HTML-versie

[Free GPT-4]

[PDF] jmlr.org

Adagrad stepsizes: Sharp convergence over nonconvex landscapes

R Ward, X Wu, L Bottou - Journal of Machine Learning Research, 2020 - jmlr.org

Adaptive gradient methods such as AdaGrad and its variants update the stepsize in
stochastic gradient descent on the fly according to the gradients received along the way; …

Opslaan Citeren Geciteerd door 373 Verwante artikelen Alle 10 versies HTML-versie

[Free GPT-4]

[PDF] neurips.cc

A near-optimal algorithm for stochastic bilevel optimization via double-momentum

P Khanduri, S Zeng, M Hong, HT Wai… - Advances in neural …, 2021 - proceedings.neurips.cc

This paper proposes a new algorithm--the\underline {S} ingle-timescale Do\underline {u} ble-
momentum\underline {St} ochastic\underline {A} pprox\underline {i} matio\underline …

Opslaan Citeren Geciteerd door 141 Verwante artikelen Alle 9 versies HTML-versie

[Free GPT-4]

[PDF] mlr.press

PAGE: A simple and optimal probabilistic gradient estimator for nonconvex optimization

Z Li, H Bao, X Zhang… - … conference on machine …, 2021 - proceedings.mlr.press

In this paper, we propose a novel stochastic gradient estimator—ProbAbilistic Gradient
Estimator (PAGE)—for nonconvex optimization. PAGE is easy to implement as it is designed …

Opslaan Citeren Geciteerd door 143 Verwante artikelen Alle 15 versies HTML-versie

[BOEK][B] First-order and stochastic optimization methods for machine learning

G Lan - 2020 - Springer

Since its beginning, optimization has played a vital role in data science. The analysis and
solution methods for many statistical and machine learning models rely on optimization. The …

Opslaan Citeren Geciteerd door 458 Verwante artikelen Alle 7 versies In bibliotheek zoeken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Variance-reduced methods for machine learning

Spider: Near-optimal non-convex optimization via stochastic path-integrated differential estimator

Lower bounds for non-convex stochastic optimization

Momentum-based variance reduction in non-convex sgd

Provably faster algorithms for bilevel optimization

Optimal stochastic non-smooth non-convex optimization through online-to-non-convex conversion

Adagrad stepsizes: Sharp convergence over nonconvex landscapes

A near-optimal algorithm for stochastic bilevel optimization via double-momentum

PAGE: A simple and optimal probabilistic gradient estimator for nonconvex optimization

[BOEK][B] First-order and stochastic optimization methods for machine learning