Google Academic

R Shwartz Ziv, Y LeCun - Entropy, 2024 - mdpi.com

Deep neural networks excel in supervised learning tasks but are constrained by the need for
extensive labeled data. Self-supervised learning emerges as a promising alternative …

Salvați Citați Citat de 87 ori Articole cu conținut similar Toate cele 10 versiuni În cache

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

Generalization bounds: Perspectives from information theory and PAC-Bayes

F Hellström, G Durisi, B Guedj… - … and Trends® in …, 2025 - nowpublishers.com

A fundamental question in theoretical machine learning is generalization. Over the past
decades, the PAC-Bayesian approach has been established as a flexible framework to …

Salvați Citați Citat de 35 ori Articole cu conținut similar Toate cele 11 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Control batch size and learning rate to generalize well: Theoretical and empirical evidence

F He, T Liu, D Tao - Advances in neural information …, 2019 - proceedings.neurips.cc

Deep neural networks have received dramatic success based on the optimization method of
stochastic gradient descent (SGD). However, it is still not clear how to tune hyper …

Salvați Citați Citat de 250 ori Articole cu conținut similar Toate cele 8 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Recent advances in deep learning theory

F He, D Tao - arxiv preprint arxiv:2012.10931, 2020 - arxiv.org

Deep learning is usually described as an experiment-driven field under continuous criticizes
of lacking theoretical foundations. This problem has been partially fixed by a large volume of …

Salvați Citați Citat de 52 ori Articole cu conținut similar Toate cele 3 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

On the power of over-parametrization in neural networks with quadratic activation

S Du, J Lee - International conference on machine learning, 2018 - proceedings.mlr.press

We provide new theoretical insights on why over-parametrization is effective in learning
neural networks. For a $ k $ hidden node shallow network with quadratic activation and $ n …

Salvați Citați Citat de 293 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Tightening mutual information-based bounds on generalization error

Y Bu, S Zou, VV Veeravalli - IEEE Journal on Selected Areas in …, 2020 - ieeexplore.ieee.org

An information-theoretic upper bound on the generalization error of supervised learning
algorithms is derived. The bound is constructed in terms of the mutual information between …

Salvați Citați Citat de 207 ori Articole cu conținut similar Toate cele 13 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Information-theoretic generalization bounds for SGLD via data-dependent estimates

J Negrea, M Haghifam, GK Dziugaite… - Advances in …, 2019 - proceedings.neurips.cc

In this work, we improve upon the stepwise analysis of noisy iterative learning algorithms
initiated by Pensia, Jog, and Loh (2018) and recently extended by Bu, Zou, and Veeravalli …

Salvați Citați Citat de 162 ori Articole cu conținut similar Toate cele 7 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Sharpened generalization bounds based on conditional mutual information and an application to noisy, iterative algorithms

M Haghifam, J Negrea, A Khisti… - Advances in …, 2020 - proceedings.neurips.cc

The information-theoretic framework of Russo and Zou (2016) and Xu and Raginsky (2017)
provides bounds on the generalization error of a learning algorithm in terms of the mutual …

Salvați Citați Citat de 120 ori Articole cu conținut similar Toate cele 5 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Information-theoretic generalization bounds for stochastic gradient descent

G Neu, GK Dziugaite, M Haghifam… - … on Learning Theory, 2021 - proceedings.mlr.press

We study the generalization properties of the popular stochastic optimization method known
as stochastic gradient descent (SGD) for optimizing general non-convex loss functions. Our …

Salvați Citați Citat de 91 ori Articole cu conținut similar Toate cele 9 versiuni Afișare ca HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Topological generalization bounds for discrete-time stochastic optimization algorithms

R Andreeva, B Dupuis, R Sarkar… - Advances in Neural …, 2025 - proceedings.neurips.cc

We present a novel set of rigorous and computationally efficient topology-based complexity
notions that exhibit a strong correlation with the generalization gap in modern deep neural …

Salvați Citați Citat de 5 ori Articole cu conținut similar Toate cele 7 versiuni Afișare ca HTML

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Generalization error bounds for noisy, iterative algorithms

To compress or not to compress—self-supervised learning and information theory: A review

Generalization bounds: Perspectives from information theory and PAC-Bayes

Control batch size and learning rate to generalize well: Theoretical and empirical evidence

Recent advances in deep learning theory

On the power of over-parametrization in neural networks with quadratic activation

Tightening mutual information-based bounds on generalization error

Information-theoretic generalization bounds for SGLD via data-dependent estimates

Sharpened generalization bounds based on conditional mutual information and an application to noisy, iterative algorithms

Information-theoretic generalization bounds for stochastic gradient descent

Topological generalization bounds for discrete-time stochastic optimization algorithms