- Academic Search

ER Anschuetz, BT Kiani - Nature Communications, 2022 - nature.com

One of the most important properties of classical neural networks is how surprisingly
trainable they are, though their training algorithms typically rely on optimizing complicated …

Opslaan Citeren Geciteerd door 241 Verwante artikelen Alle 13 versies

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Fl-ntk: A neural tangent kernel-based framework for federated learning analysis

B Huang, X Li, Z Song, X Yang - International Conference on …, 2021 - proceedings.mlr.press

Federated Learning (FL) is an emerging learning scheme that allows different distributed
clients to train deep neural networks together without data sharing. Neural networks have …

Opslaan Citeren Geciteerd door 77 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

When deep learning meets polyhedral theory: A survey

J Huchette, G Muñoz, T Serra, C Tsay - arxiv preprint arxiv:2305.00241, 2023 - arxiv.org

In the past decade, deep learning became the prevalent methodology for predictive
modeling thanks to the remarkable accuracy of deep neural networks in tasks such as …

Opslaan Citeren Geciteerd door 40 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Provably learning a multi-head attention layer

S Chen, Y Li - arxiv preprint arxiv:2402.04084, 2024 - arxiv.org

The multi-head attention layer is one of the key components of the transformer architecture
that sets it apart from traditional feed-forward models. Given a sequence length $ k …

Opslaan Citeren Geciteerd door 16 Verwante artikelen Alle 3 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Towards lower bounds on the depth of ReLU neural networks

C Hertrich, A Basu, M Di Summa… - Advances in Neural …, 2021 - proceedings.neurips.cc

We contribute to a better understanding of the class of functions that is represented by a
neural network with ReLU activations and a given architecture. Using techniques from mixed …

Opslaan Citeren Geciteerd door 51 Verwante artikelen Alle 14 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Bounding the width of neural networks via coupled initialization a worst case analysis

A Munteanu, S Omlor, Z Song… - … on Machine Learning, 2022 - proceedings.mlr.press

A common method in training neural networks is to initialize all the weights to be
independent Gaussian vectors. We observe that by instead initializing the weights into …

Opslaan Citeren Geciteerd door 26 Verwante artikelen Alle 5 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Hardness of noise-free learning for two-hidden-layer neural networks

S Chen, A Gollakota, A Klivans… - Advances in Neural …, 2022 - proceedings.neurips.cc

We give superpolynomial statistical query (SQ) lower bounds for learning two-hidden-layer
ReLU networks with respect to Gaussian inputs in the standard (noise-free) model. No …

Opslaan Citeren Geciteerd door 37 Verwante artikelen Alle 8 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Training Fully Connected Neural Networks is -Complete

D Bertschinger, C Hertrich… - Advances in …, 2023 - proceedings.neurips.cc

We consider the algorithmic problem of finding the optimal weights and biases for a two-
layer fully connected neural network to fit a given set of data points, also known as empirical …

Opslaan Citeren Geciteerd door 33 Verwante artikelen Alle 11 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Learning narrow one-hidden-layer relu networks

S Chen, Z Dou, S Goel, A Klivans… - The Thirty Sixth Annual …, 2023 - proceedings.mlr.press

We consider the well-studied problem of learning a linear combination of $ k $ ReLU
activations with respect to a Gaussian distribution on inputs in $ d $ dimensions. We give the …

Opslaan Citeren Geciteerd door 15 Verwante artikelen Alle 5 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Agnostically learning multi-index models with queries

I Diakonikolas, DM Kane, V Kontonis… - 2024 IEEE 65th …, 2024 - ieeexplore.ieee.org

We study the power of query access for the fundamental task of agnostic learning under the
Gaussian distribution. In the agnostic model, no assumptions are made on the labels of the …

Opslaan Citeren Geciteerd door 6 Verwante artikelen Alle 7 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Learning deep relu networks is fixed-parameter tractable

Quantum variational algorithms are swamped with traps

Fl-ntk: A neural tangent kernel-based framework for federated learning analysis

When deep learning meets polyhedral theory: A survey

Provably learning a multi-head attention layer

Towards lower bounds on the depth of ReLU neural networks

Bounding the width of neural networks via coupled initialization a worst case analysis

Hardness of noise-free learning for two-hidden-layer neural networks

Training Fully Connected Neural Networks is -Complete

Learning narrow one-hidden-layer relu networks

Agnostically learning multi-index models with queries