Google Acadèmic

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

Desa Cita Citat per 1719 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of multi-agent reinforcement learning from game theoretical perspective

Y Yang, J Wang - arxiv preprint arxiv:2011.00583, 2020 - arxiv.org

Following the remarkable success of the AlphaGO series, 2019 was a booming year that
witnessed significant advances in multi-agent reinforcement learning (MARL) techniques …

Desa Cita Citat per 352 Articles relacionats Totes les 2 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] siam.org

Understanding and mitigating gradient flow pathologies in physics-informed neural networks

S Wang, Y Teng, P Perdikaris - SIAM Journal on Scientific Computing, 2021 - SIAM

The widespread use of neural networks across different scientific domains often involves
constraining them to satisfy certain symmetries, conservation laws, or other domain …

Desa Cita Citat per 1085 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

On gradient descent ascent for nonconvex-concave minimax problems

T Lin, C **, M Jordan - International Conference on …, 2020 - proceedings.mlr.press

We consider nonconvex-concave minimax problems, $\min_ {\mathbf {x}}\max_ {\mathbf
{y}\in\mathcal {Y}} f (\mathbf {x},\mathbf {y}) $, where $ f $ is nonconvex in $\mathbf {x} $ but …

Desa Cita Citat per 620 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Optimizing millions of hyperparameters by implicit differentiation

J Lorraine, P Vicol, D Duvenaud - … conference on artificial …, 2020 - proceedings.mlr.press

We propose an algorithm for inexpensive gradient-based hyperparameter optimization that
combines the implicit function theorem (IFT) with efficient inverse Hessian approximations …

Desa Cita Citat per 464 Articles relacionats Totes les 9 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

Desa Cita Citat per 706 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Understanding and mitigating gradient pathologies in physics-informed neural networks

S Wang, Y Teng, P Perdikaris - arxiv preprint arxiv:2001.04536, 2020 - arxiv.org

The widespread use of neural networks across different scientific domains often involves
constraining them to satisfy certain symmetries, conservation laws, or other domain …

Desa Cita Citat per 307 Articles relacionats Totes les 3 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Solving a class of non-convex min-max games using iterative first order methods

M Nouiehed, M Sanjabi, T Huang… - Advances in …, 2019 - proceedings.neurips.cc

Recent applications that arise in machine learning have surged significant interest in solving
min-max saddle point games. This problem has been extensively studied in the convex …

Desa Cita Citat per 394 Articles relacionats Totes les 10 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

P Mertikopoulos, B Lecouat, H Zenati, CS Foo… - arxiv preprint arxiv …, 2018 - arxiv.org

Owing to their connection with generative adversarial networks (GANs), saddle-point
problems have recently attracted considerable interest in machine learning and beyond. By …

Desa Cita Citat per 352 Articles relacionats Totes les 10 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Implicit gradient regularization

DGT Barrett, B Dherin - arxiv preprint arxiv:2009.11162, 2020 - arxiv.org

Gradient descent can be surprisingly good at optimizing deep neural networks without
overfitting and without explicit regularization. We find that the discrete steps of gradient …

Desa Cita Citat per 169 Articles relacionats Totes les 4 versions Free GPT-4 DeepSeek Versió HTML

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

The mechanics of n-player differentiable games

Multi-agent reinforcement learning: A selective overview of theories and algorithms

An overview of multi-agent reinforcement learning from game theoretical perspective

Understanding and mitigating gradient flow pathologies in physics-informed neural networks

On gradient descent ascent for nonconvex-concave minimax problems

Optimizing millions of hyperparameters by implicit differentiation

A survey and critique of multiagent deep reinforcement learning

Understanding and mitigating gradient pathologies in physics-informed neural networks

Solving a class of non-convex min-max games using iterative first order methods

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

Implicit gradient regularization