- Academic Search

RM Gower, M Schmidt, F Bach… - Proceedings of the …, 2020 - ieeexplore.ieee.org

Stochastic optimization lies at the heart of machine learning, and its cornerstone is
stochastic gradient descent (SGD), a method introduced over 60 years ago. The last eight …

Lagre Referanse Sitert av 145 Beslektede artikler Alle 14 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Closing the gap: Tighter analysis of alternating stochastic gradient methods for bilevel problems

T Chen, Y Sun, W Yin - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Stochastic nested optimization, including stochastic compositional, min-max, and bilevel
optimization, is gaining popularity in many machine learning applications. While the three …

Lagre Referanse Sitert av 132 Beslektede artikler Alle 7 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Decentralized gossip-based stochastic bilevel optimization over communication networks

S Yang, X Zhang, M Wang - Advances in neural information …, 2022 - proceedings.neurips.cc

Bilevel optimization have gained growing interests, with numerous applications found in
meta learning, minimax games, reinforcement learning, and nested composition …

Lagre Referanse Sitert av 55 Beslektede artikler Alle 8 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Multi-agent reinforcement learning via double averaging primal-dual optimization

HT Wai, Z Yang, Z Wang… - Advances in neural …, 2018 - proceedings.neurips.cc

Despite the success of single-agent reinforcement learning, multi-agent reinforcement
learning (MARL) remains challenging due to complex interactions between agents …

Lagre Referanse Sitert av 207 Beslektede artikler Alle 8 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Stochastic optimization of areas under precision-recall curves with provable convergence

Q Qi, Y Luo, Z Xu, S Ji, T Yang - Advances in neural …, 2021 - proceedings.neurips.cc

Areas under ROC (AUROC) and precision-recall curves (AUPRC) are common metrics for
evaluating classification performance for imbalanced problems. Compared with AUROC …

Lagre Referanse Sitert av 86 Beslektede artikler Alle 9 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Stochastic variance reduction methods for policy evaluation

SS Du, J Chen, L Li, L **ao… - … conference on machine …, 2017 - proceedings.mlr.press

Policy evaluation is concerned with estimating the value function that predicts long-term
values of states under a given policy. It is a crucial step in many reinforcement-learning …

Lagre Referanse Sitert av 215 Beslektede artikler Alle 6 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A single timescale stochastic approximation method for nested stochastic optimization

S Ghadimi, A Ruszczynski, M Wang - SIAM Journal on Optimization, 2020 - SIAM

We study constrained nested stochastic optimization problems in which the objective
function is a composition of two smooth functions whose exact values and derivatives are …

Lagre Referanse Sitert av 141 Beslektede artikler Alle 9 versjoner

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

A single-timescale method for stochastic bilevel optimization

T Chen, Y Sun, Q **ao, W Yin - International Conference on …, 2022 - proceedings.mlr.press

Stochastic bilevel optimization generalizes the classic stochastic optimization from the
minimization of a single objective to the minimization of an objective function that depends …

Lagre Referanse Sitert av 90 Beslektede artikler Alle 2 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

On the convergence and sample efficiency of variance-reduced policy gradient method

J Zhang, C Ni, C Szepesvari… - Advances in Neural …, 2021 - proceedings.neurips.cc

Policy gradient (PG) gives rise to a rich class of reinforcement learning (RL) methods.
Recently, there has been an emerging trend to augment the existing PG methods such as …

Lagre Referanse Sitert av 82 Beslektede artikler Alle 10 versjoner HTML-versjon

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A single-timescale method for stochastic bilevel optimization

T Chen, Y Sun, Q **ao, W Yin - arxiv preprint arxiv:2102.04671, 2021 - arxiv.org

Stochastic bilevel optimization generalizes the classic stochastic optimization from the
minimization of a single objective to the minimization of an objective function that depends …

Lagre Referanse Sitert av 83 Beslektede artikler Alle 3 versjoner HTML-versjon

Opprett varsel

Referanse

Avansert søk

Lagret i Mitt bibliotek

Finite-sum composition optimization via variance reduced gradient descent

Variance-reduced methods for machine learning

Closing the gap: Tighter analysis of alternating stochastic gradient methods for bilevel problems

Decentralized gossip-based stochastic bilevel optimization over communication networks

Multi-agent reinforcement learning via double averaging primal-dual optimization

Stochastic optimization of areas under precision-recall curves with provable convergence

Stochastic variance reduction methods for policy evaluation

A single timescale stochastic approximation method for nested stochastic optimization

A single-timescale method for stochastic bilevel optimization

On the convergence and sample efficiency of variance-reduced policy gradient method

A single-timescale method for stochastic bilevel optimization