Академия Google

SL Brunton, J Nathan Kutz, K Manohar, AY Aravkin… - AIAA Journal, 2021 - arc.aiaa.org

Data science, and machine learning in particular, is rapidly transforming the scientific and
industrial landscapes. The aerospace industry is poised to capitalize on big data and …

Сохранить Цитировать Цитируется: 244 Похожие статьи Все версии статьи (6)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Directional convergence and alignment in deep learning

Z Ji, M Telgarsky - Advances in Neural Information …, 2020 - proceedings.neurips.cc

In this paper, we show that although the minimizers of cross-entropy and related
classification losses are off at infinity, network weights learned by gradient flow converge in …

Сохранить Цитировать Цитируется: 195 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Learning single-index models with shallow neural networks

A Bietti, J Bruna, C Sanford… - Advances in Neural …, 2022 - proceedings.neurips.cc

Single-index models are a class of functions given by an unknown univariate``link''function
applied to an unknown one-dimensional projection of the input. These models are …

Сохранить Цитировать Цитируется: 90 Похожие статьи Все версии статьи (12) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Gradient descent provably optimizes over-parameterized neural networks

SS Du, X Zhai, B Poczos, A Singh - arxiv preprint arxiv:1810.02054, 2018 - arxiv.org

One of the mysteries in the success of neural networks is randomly initialized first order
methods like gradient descent can achieve zero training loss even though the objective …

Сохранить Цитировать Цитируется: 855 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Unbalanced minibatch optimal transport; applications to domain adaptation

K Fatras, T Séjourné, R Flamary… - … on Machine Learning, 2021 - proceedings.mlr.press

Optimal transport distances have found many applications in machine learning for their
capacity to compare non-parametric probability distributions. Yet their algorithmic complexity …

Сохранить Цитировать Цитируется: 168 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Gradient descent maximizes the margin of homogeneous neural networks

K Lyu, J Li - arxiv preprint arxiv:1906.05890, 2019 - arxiv.org

In this paper, we study the implicit regularization of the gradient descent algorithm in
homogeneous neural networks, including fully-connected and convolutional neural …

Сохранить Цитировать Цитируется: 363 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]
[DeepSeek]

[HTML] informs.org

Global optimality guarantees for policy gradient methods

J Bhandari, D Russo - Operations Research, 2024 - pubsonline.informs.org

Policy gradients methods apply to complex, poorly understood, control problems by
performing stochastic gradient descent over a parameterized class of polices. Unfortunately …

Сохранить Цитировать Цитируется: 286 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Gradient descent on two-layer nets: Margin maximization and simplicity bias

K Lyu, Z Li, R Wang, S Arora - Advances in Neural …, 2021 - proceedings.neurips.cc

The generalization mystery of overparametrized deep nets has motivated efforts to
understand how gradient descent (GD) converges to low-loss solutions that generalize well …

Сохранить Цитировать Цитируется: 87 Похожие статьи Все версии статьи (7) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Gradient-free methods for deterministic and stochastic nonsmooth nonconvex optimization

T Lin, Z Zheng, M Jordan - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Nonsmooth nonconvex optimization problems broadly emerge in machine learning and
business decision making, whereas two core challenges impede the development of …

Сохранить Цитировать Цитируется: 48 Похожие статьи Все версии статьи (6) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Algorithmic regularization in learning deep homogeneous models: Layers are automatically balanced

SS Du, W Hu, JD Lee - Advances in neural information …, 2018 - proceedings.neurips.cc

We study the implicit regularization imposed by gradient descent for learning multi-layer
homogeneous functions including feed-forward fully connected and convolutional deep …

Сохранить Цитировать Цитируется: 248 Похожие статьи Все версии статьи (7) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Stochastic subgradient method converges on tame functions

Data-driven aerospace engineering: reframing the industry with machine learning

Directional convergence and alignment in deep learning

Learning single-index models with shallow neural networks

Gradient descent provably optimizes over-parameterized neural networks

Unbalanced minibatch optimal transport; applications to domain adaptation

Gradient descent maximizes the margin of homogeneous neural networks

Global optimality guarantees for policy gradient methods

Gradient descent on two-layer nets: Margin maximization and simplicity bias

Gradient-free methods for deterministic and stochastic nonsmooth nonconvex optimization

Algorithmic regularization in learning deep homogeneous models: Layers are automatically balanced