Variance-reduced methods for machine learning

RM Gower, M Schmidt, F Bach… - Proceedings of the …, 2020 - ieeexplore.ieee.org
Stochastic optimization lies at the heart of machine learning, and its cornerstone is
stochastic gradient descent (SGD), a method introduced over 60 years ago. The last eight …

Closing the gap: Tighter analysis of alternating stochastic gradient methods for bilevel problems

T Chen, Y Sun, W Yin - Advances in Neural Information …, 2021 - proceedings.neurips.cc
Stochastic nested optimization, including stochastic compositional, min-max, and bilevel
optimization, is gaining popularity in many machine learning applications. While the three …

Decentralized gossip-based stochastic bilevel optimization over communication networks

S Yang, X Zhang, M Wang - Advances in neural information …, 2022 - proceedings.neurips.cc
Bilevel optimization have gained growing interests, with numerous applications found in
meta learning, minimax games, reinforcement learning, and nested composition …

Multi-agent reinforcement learning via double averaging primal-dual optimization

HT Wai, Z Yang, Z Wang… - Advances in neural …, 2018 - proceedings.neurips.cc
Despite the success of single-agent reinforcement learning, multi-agent reinforcement
learning (MARL) remains challenging due to complex interactions between agents …

Stochastic optimization of areas under precision-recall curves with provable convergence

Q Qi, Y Luo, Z Xu, S Ji, T Yang - Advances in neural …, 2021 - proceedings.neurips.cc
Areas under ROC (AUROC) and precision-recall curves (AUPRC) are common metrics for
evaluating classification performance for imbalanced problems. Compared with AUROC …

Stochastic variance reduction methods for policy evaluation

SS Du, J Chen, L Li, L **ao… - … conference on machine …, 2017 - proceedings.mlr.press
Policy evaluation is concerned with estimating the value function that predicts long-term
values of states under a given policy. It is a crucial step in many reinforcement-learning …

A single timescale stochastic approximation method for nested stochastic optimization

S Ghadimi, A Ruszczynski, M Wang - SIAM Journal on Optimization, 2020 - SIAM
We study constrained nested stochastic optimization problems in which the objective
function is a composition of two smooth functions whose exact values and derivatives are …

A single-timescale method for stochastic bilevel optimization

T Chen, Y Sun, Q **ao, W Yin - International Conference on …, 2022 - proceedings.mlr.press
Stochastic bilevel optimization generalizes the classic stochastic optimization from the
minimization of a single objective to the minimization of an objective function that depends …

On the convergence and sample efficiency of variance-reduced policy gradient method

J Zhang, C Ni, C Szepesvari… - Advances in Neural …, 2021 - proceedings.neurips.cc
Policy gradient (PG) gives rise to a rich class of reinforcement learning (RL) methods.
Recently, there has been an emerging trend to augment the existing PG methods such as …

A single-timescale method for stochastic bilevel optimization

T Chen, Y Sun, Q **ao, W Yin - arxiv preprint arxiv:2102.04671, 2021 - arxiv.org
Stochastic bilevel optimization generalizes the classic stochastic optimization from the
minimization of a single objective to the minimization of an objective function that depends …