Data-driven aerospace engineering: reframing the industry with machine learning

SL Brunton, J Nathan Kutz, K Manohar, AY Aravkin… - AIAA Journal, 2021 - arc.aiaa.org
Data science, and machine learning in particular, is rapidly transforming the scientific and
industrial landscapes. The aerospace industry is poised to capitalize on big data and …

Directional convergence and alignment in deep learning

Z Ji, M Telgarsky - Advances in Neural Information …, 2020 - proceedings.neurips.cc
In this paper, we show that although the minimizers of cross-entropy and related
classification losses are off at infinity, network weights learned by gradient flow converge in …

Learning single-index models with shallow neural networks

A Bietti, J Bruna, C Sanford… - Advances in Neural …, 2022 - proceedings.neurips.cc
Single-index models are a class of functions given by an unknown univariate``link''function
applied to an unknown one-dimensional projection of the input. These models are …

Gradient descent provably optimizes over-parameterized neural networks

SS Du, X Zhai, B Poczos, A Singh - arxiv preprint arxiv:1810.02054, 2018 - arxiv.org
One of the mysteries in the success of neural networks is randomly initialized first order
methods like gradient descent can achieve zero training loss even though the objective …

Unbalanced minibatch optimal transport; applications to domain adaptation

K Fatras, T Séjourné, R Flamary… - … on Machine Learning, 2021 - proceedings.mlr.press
Optimal transport distances have found many applications in machine learning for their
capacity to compare non-parametric probability distributions. Yet their algorithmic complexity …

Gradient descent maximizes the margin of homogeneous neural networks

K Lyu, J Li - arxiv preprint arxiv:1906.05890, 2019 - arxiv.org
In this paper, we study the implicit regularization of the gradient descent algorithm in
homogeneous neural networks, including fully-connected and convolutional neural …

Global optimality guarantees for policy gradient methods

J Bhandari, D Russo - Operations Research, 2024 - pubsonline.informs.org
Policy gradients methods apply to complex, poorly understood, control problems by
performing stochastic gradient descent over a parameterized class of polices. Unfortunately …

Gradient descent on two-layer nets: Margin maximization and simplicity bias

K Lyu, Z Li, R Wang, S Arora - Advances in Neural …, 2021 - proceedings.neurips.cc
The generalization mystery of overparametrized deep nets has motivated efforts to
understand how gradient descent (GD) converges to low-loss solutions that generalize well …

Gradient-free methods for deterministic and stochastic nonsmooth nonconvex optimization

T Lin, Z Zheng, M Jordan - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Nonsmooth nonconvex optimization problems broadly emerge in machine learning and
business decision making, whereas two core challenges impede the development of …

Algorithmic regularization in learning deep homogeneous models: Layers are automatically balanced

SS Du, W Hu, JD Lee - Advances in neural information …, 2018 - proceedings.neurips.cc
We study the implicit regularization imposed by gradient descent for learning multi-layer
homogeneous functions including feed-forward fully connected and convolutional deep …