„Google“ mokslinčius

Išsaugoti Cituoti Cituoja 118 Susiję straipsniai Visos 9 versijos HTML kopija

Neural networks are convex regularizers: Exact polynomial-time convex optimization formulations for two-layer networks

M Pilanci, T Ergen - International Conference on Machine …, 2020 - proceedings.mlr.press

We develop exact representations of training two-layer neural networks with rectified linear
units (ReLUs) in terms of a single convex program with number of variables polynomial in …

Išsaugoti Cituoti Cituoja 63 Susiję straipsniai Visos 6 versijos HTML kopija

Implicit regularization towards rank minimization in relu networks

N Timor, G Vardi, O Shamir - International Conference on …, 2023 - proceedings.mlr.press

We study the conjectured relationship between the implicit regularization in neural networks,
trained with gradient-based methods, and rank minimization of their weight matrices …

Išsaugoti Cituoti Cituoja 36 Susiję straipsniai Visos 12 versijos HTML kopija

On the effective number of linear regions in shallow univariate relu networks: Convergence guarantees and implicit bias

I Safran, G Vardi, JD Lee - Advances in Neural Information …, 2022 - proceedings.neurips.cc

We study the dynamics and implicit bias of gradient flow (GF) on univariate ReLU neural
networks with a single hidden layer in a binary classification setting. We show that when the …

Išsaugoti Cituoti Cituoja 73 Susiję straipsniai Visos 8 versijos HTML kopija

Revealing the structure of deep neural networks via convex duality

T Ergen, M Pilanci - International Conference on Machine …, 2021 - proceedings.mlr.press

We study regularized deep neural networks (DNNs) and introduce a convex analytic
framework to characterize the structure of the hidden layers. We show that a set of optimal …

Išsaugoti Cituoti Cituoja 15 Susiję straipsniai Visos 6 versijos HTML kopija

Learning a neuron by a shallow relu network: Dynamics and implicit bias for correlated inputs

D Chistikov, M Englert, R Lazic - Advances in Neural …, 2023 - proceedings.neurips.cc

We prove that, for the fundamental regression task of learning a single neuron, training a
one-hidden layer ReLU network of any width by gradient flow from a small initialisation …

Išsaugoti Cituoti Cituoja 43 Susiję straipsniai Visos 6 versijos HTML kopija

Global optimality beyond two layers: Training deep relu networks via convex programs

T Ergen, M Pilanci - International Conference on Machine …, 2021 - proceedings.mlr.press

Understanding the fundamental mechanism behind the success of deep neural networks is
one of the key challenges in the modern machine learning literature. Despite numerous …

Išsaugoti Cituoti Cituoja 7 Susiję straipsniai Visos 6 versijos HTML kopija

How do minimum-norm shallow denoisers look in function space?

C Zeno, G Ongie, Y Blumenfeld… - Advances in …, 2023 - proceedings.neurips.cc

Neural network (NN) denoisers are an essential building block in many common tasks,
ranging from image reconstruction to image generation. However, the success of these …

Išsaugoti Cituoti Cituoja 12 Susiję straipsniai Visos 5 versijos HTML kopija

[PDF] arxiv.org

Noisy interpolation learning with shallow univariate relu networks

N Joshi, G Vardi, N Srebro - arxiv preprint arxiv:2307.15396, 2023 - arxiv.org

Understanding how overparameterized neural networks generalize despite perfect
interpolation of noisy training data is a fundamental question. Mallinar et. al. 2022 noted that …