Google Académico

E Chou, J Beal, D Levy, S Yeung, A Haque… - arxiv preprint arxiv …, 2018 - arxiv.org

Homomorphic encryption enables arbitrary computation over data while it remains
encrypted. This privacy-preserving feature is attractive for machine learning, but requires …

Guardar Citar Citado por 257 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

The loss surface of deep and wide neural networks

Q Nguyen, M Hein - International conference on machine …, 2017 - proceedings.mlr.press

While the optimization problem behind deep neural networks is highly non-convex, it is
frequently observed in practice that training deep networks seems possible without getting …

Guardar Citar Citado por 324 Artículos relacionados Las 11 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Safetynets: Verifiable execution of deep neural networks on an untrusted cloud

Z Ghodsi, T Gu, S Garg - Advances in Neural Information …, 2017 - proceedings.neurips.cc

Inference using deep neural networks is often outsourced to the cloud since it is a
computationally demanding task. However, this raises a fundamental issue of trust. How can …

Guardar Citar Citado por 218 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

The mechanism of prediction head in non-contrastive self-supervised learning

Z Wen, Y Li - Advances in Neural Information Processing …, 2022 - proceedings.neurips.cc

The surprising discovery of the BYOL method shows the negative samples can be replaced
by adding the prediction head to the network. It is mysterious why even when there exist …

Guardar Citar Citado por 36 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

When is a convolutional filter easy to learn?

SS Du, JD Lee, Y Tian - arxiv preprint arxiv:1709.06129, 2017 - arxiv.org

We analyze the convergence of (stochastic) gradient descent algorithm for learning a
convolutional filter with Rectified Linear Unit (ReLU) activation function. Our analysis does …

Guardar Citar Citado por 144 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

On connected sublevel sets in deep learning

Q Nguyen - International conference on machine learning, 2019 - proceedings.mlr.press

This paper shows that every sublevel set of the loss function of a class of deep over-
parameterized neural nets with piecewise linear activation functions is connected and …

Guardar Citar Citado por 111 Artículos relacionados Las 10 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Optimization landscape and expressivity of deep CNNs

Q Nguyen, M Hein - International conference on machine …, 2018 - proceedings.mlr.press

We analyze the loss landscape and expressiveness of practical deep convolutional neural
networks (CNNs) with shared weights and max pooling layers. We show that such CNNs …

Guardar Citar Citado por 119 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

On the loss landscape of a class of deep neural networks with no bad local valleys

Q Nguyen, MC Mukkamala, M Hein - arxiv preprint arxiv:1809.10749, 2018 - arxiv.org

We identify a class of over-parameterized deep neural networks with standard activation
functions and cross-entropy loss which provably have no bad local valley, in the sense that …

Guardar Citar Citado por 95 Artículos relacionados Las 8 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Adding one neuron can eliminate all bad local minima

S Liang, R Sun, JD Lee… - Advances in Neural …, 2018 - proceedings.neurips.cc

One of the main difficulties in analyzing neural networks is the non-convexity of the loss
function which may have many bad local minima. In this paper, we study the landscape of …

Guardar Citar Citado por 103 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Understanding the loss surface of neural networks for binary classification

S Liang, R Sun, Y Li, R Srikant - International Conference on …, 2018 - proceedings.mlr.press

It is widely conjectured that training algorithms for neural networks are successful because
all local minima lead to similar performance; for example, see (LeCun et al., 2015; …

Guardar Citar Citado por 92 Artículos relacionados Las 6 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Globally optimal training of generalized polynomial neural networks with nonlinear spectral methods

Faster cryptonets: Leveraging sparsity for real-world encrypted inference

The loss surface of deep and wide neural networks

Safetynets: Verifiable execution of deep neural networks on an untrusted cloud

The mechanism of prediction head in non-contrastive self-supervised learning

When is a convolutional filter easy to learn?

On connected sublevel sets in deep learning

Optimization landscape and expressivity of deep CNNs

On the loss landscape of a class of deep neural networks with no bad local valleys

Adding one neuron can eliminate all bad local minima

Understanding the loss surface of neural networks for binary classification