- Academic Search

Z Allen-Zhu, Y Li, Y Liang - Advances in neural information …, 2019 - proceedings.neurips.cc

Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers
Page 1 Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two …

Salva Cita Citato da 903 Articoli correlati Tutte e 12 le versioni Versione HTML

[Free GPT-4]

[PDF] pnas.org Full View

A mean field view of the landscape of two-layer neural networks

S Mei, A Montanari, PM Nguyen - Proceedings of the …, 2018 - National Acad Sciences

Multilayer neural networks are among the most powerful models in machine learning, yet the
fundamental reasons for this success defy mathematical understanding. Learning a neural …

Salva Cita Citato da 997 Articoli correlati Tutte e 12 le versioni

[Free GPT-4]

[PDF] ieee.org

Theoretical insights into the optimization landscape of over-parameterized shallow neural networks

M Soltanolkotabi, A Javanmard… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

In this paper, we study the problem of learning a shallow artificial neural network that best
fits a training data set. We study this problem in the over-parameterized regime where the …

Salva Cita Citato da 489 Articoli correlati Tutte e 10 le versioni

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] A comparison of deep networks with ReLU activation function and linear spline-type methods

K Eckle, J Schmidt-Hieber - Neural Networks, 2019 - Elsevier

Deep neural networks (DNNs) generate much richer function spaces than shallow networks.
Since the function spaces induced by shallow networks have several approximation …

Salva Cita Citato da 436 Articoli correlati Tutte e 9 le versioni

[Free GPT-4]

[PDF] mlr.press

Rademacher complexity for adversarially robust generalization

D Yin, R Kannan, P Bartlett - International conference on …, 2019 - proceedings.mlr.press

Many machine learning models are vulnerable to adversarial attacks; for example, adding
adversarial perturbations that are imperceptible to humans can often make machine …

Salva Cita Citato da 321 Articoli correlati Tutte e 8 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Simple recurrent units for highly parallelizable recurrence

T Lei, Y Zhang, SI Wang, H Dai, Y Artzi - arxiv preprint arxiv:1709.02755, 2017 - arxiv.org

Common recurrent neural architectures scale poorly due to the intrinsic difficulty in
parallelizing their state computations. In this work, we propose the Simple Recurrent Unit …

Salva Cita Citato da 350 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] aaai.org

Beyond sparsity: Tree regularization of deep models for interpretability

M Wu, M Hughes, S Parbhoo, M Zazzi, V Roth… - Proceedings of the …, 2018 - ojs.aaai.org

The lack of interpretability remains a key barrier to the adoption of deep models in many
applications. In this work, we explicitly regularize deep models so human users might step …

Salva Cita Citato da 338 Articoli correlati Tutte e 13 le versioni Versione HTML

[Free GPT-4]

[PDF] mlr.press

Recovery guarantees for one-hidden-layer neural networks

K Zhong, Z Song, P Jain, PL Bartlett… - … on machine learning, 2017 - proceedings.mlr.press

In this paper, we consider regression problems with one-hidden-layer neural networks
(1NNs). We distill some properties of activation functions that lead to local strong convexity …

Salva Cita Citato da 372 Articoli correlati Tutte e 10 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Learning one-hidden-layer neural networks with landscape design

R Ge, JD Lee, T Ma - arxiv preprint arxiv:1711.00501, 2017 - arxiv.org

We consider the problem of learning a one-hidden-layer neural network: we assume the
input $ x\in\mathbb {R}^ d $ is from Gaussian distribution and the label $ y= a^\top\sigma …

Salva Cita Citato da 313 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] neurips.cc

What can resnet learn efficiently, going beyond kernels?

Z Allen-Zhu, Y Li - Advances in Neural Information …, 2019 - proceedings.neurips.cc

How can neural networks such as ResNet\emph {efficiently} learn CIFAR-10 with test
accuracy more than $96\% $, while other methods, especially kernel methods, fall relatively …

Salva Cita Citato da 235 Articoli correlati Tutte e 7 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

l1-regularized neural networks are improperly learnable in polynomial time

Learning and generalization in overparameterized neural networks, going beyond two layers

A mean field view of the landscape of two-layer neural networks

Theoretical insights into the optimization landscape of over-parameterized shallow neural networks

[HTML][HTML] A comparison of deep networks with ReLU activation function and linear spline-type methods

Rademacher complexity for adversarially robust generalization

Simple recurrent units for highly parallelizable recurrence

Beyond sparsity: Tree regularization of deep models for interpretability

Recovery guarantees for one-hidden-layer neural networks

Learning one-hidden-layer neural networks with landscape design

What can resnet learn efficiently, going beyond kernels?