The elements of differentiable programming

M Blondel, V Roulet - arxiv preprint arxiv:2403.14606, 2024 - arxiv.org
Artificial intelligence has recently experienced remarkable advances, fueled by large
models, vast datasets, accelerated hardware, and, last but not least, the transformative …

Dpm-solver-v3: Improved diffusion ode solver with empirical model statistics

K Zheng, C Lu, J Chen, J Zhu - Advances in Neural …, 2023 - proceedings.neurips.cc
Diffusion probabilistic models (DPMs) have exhibited excellent performance for high-fidelity
image generation while suffering from inefficient sampling. Recent works accelerate the …

Uncertainty and resolution analysis of 2D and 3D inversion models computed from geophysical electromagnetic data

Z Ren, T Kalscheuer - Surveys in Geophysics, 2020 - Springer
A meaningful solution to an inversion problem should be composed of the preferred
inversion model and its uncertainty and resolution estimates. The model uncertainty …

Adahessian: An adaptive second order optimizer for machine learning

Z Yao, A Gholami, S Shen, M Mustafa… - proceedings of the …, 2021 - ojs.aaai.org
Incorporating second-order curvature information into machine learning optimization
algorithms can be subtle, and doing so naïvely can lead to high per-iteration costs …

Pruning convolutional neural networks for resource efficient inference

P Molchanov, S Tyree, T Karras, T Aila… - arxiv preprint arxiv …, 2016 - arxiv.org
We propose a new formulation for pruning convolutional kernels in neural networks to
enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by …

Adaptive second order coresets for data-efficient machine learning

O Pooladzandi, D Davini… - … on Machine Learning, 2022 - proceedings.mlr.press
Training machine learning models on massive datasets incurs substantial computational
costs. To alleviate such costs, there has been a sustained effort to develop data-efficient …

Equilibrated adaptive learning rates for non-convex optimization

Y Dauphin, H De Vries… - Advances in neural …, 2015 - proceedings.neurips.cc
Parameter-specific adaptive learning rate methods are computationally efficient ways to
reduce the ill-conditioning problems encountered when training large deep networks …

Randomized algorithms for matrices and data

MW Mahoney - Foundations and Trends® in Machine …, 2011 - nowpublishers.com
Randomized algorithms for very large matrix problems have received a great deal of
attention in recent years. Much of this work was motivated by problems in large-scale data …

[KNIHA][B] Numerical linear algebra

LN Trefethen, D Bau - 2022 - SIAM
Since the early 1980. the first author has taught a graduate course in numerical linear
algebra at MIT and Cornell. The alumni of this course, now numbering in the hundreds, have …

[KNIHA][B] Parameter estimation and inverse problems

RC Aster, B Borchers, CH Thurber - 2018 - books.google.com
Parameter Estimation and Inverse Problems, Third Edition, is structured around a course at
New Mexico Tech and is designed to be accessible to typical graduate students in the …