Communication-efficient distributed SGD with sketching

N Ivkin, D Rothchild, E Ullah… - Advances in Neural …, 2019 - proceedings.neurips.cc
Large-scale distributed training of neural networks is often limited by network bandwidth,
wherein the communication time overwhelms the local computation time. Motivated by the …

[PDF][PDF] Learning-Based Frequency Estimation Algorithms.

CY Hsu, P Indyk, D Katabi, A Vakilian - International Conference on …, 2019 - par.nsf.gov
Estimating the frequencies of elements in a data stream is a fundamental task in data
analysis and machine learning. The problem is typically addressed using streaming …

To petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics

RAL Elworth, Q Wang, PK Kota… - Nucleic acids …, 2020 - academic.oup.com
As computational biologists continue to be inundated by ever increasing amounts of
metagenomic data, the need for data analysis approaches that keep up with the pace of …

Improved frequency estimation algorithms with and without predictions

A Aamand, J Chen, H Nguyen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Estimating frequencies of elements appearing in a data stream is a key task in large-scale
data analysis. Popular sketching approaches to this problem (eg, CountMin and …

On the robustness of countsketch to adaptive inputs

E Cohen, X Lyu, J Nelson, T Sarlós… - International …, 2022 - proceedings.mlr.press
The last decade saw impressive progress towards understanding the performance of
algorithms in adaptive settings, where subsequent inputs may depend on the output from …

Asymptotics of the sketched pseudoinverse

D LeJeune, P Patil, H Javadi, RG Baraniuk… - SIAM Journal on …, 2024 - SIAM
We take a random matrix theory approach to random sketching and show an asymptotic first-
order equivalence of the regularized sketched pseudoinverse of a positive semidefinite …

Quick and robust feature selection: the strength of energy-efficient sparse training for autoencoders

Z Atashgahi, G Sokar, T van der Lee, E Mocanu… - Machine Learning, 2022 - Springer
Major complications arise from the recent increase in the amount of high-dimensional data,
including high computational costs and memory requirements. Feature selection, which …

Compressing gradient optimizers via count-sketches

R Spring, A Kyrillidis, V Mohan… - … on Machine Learning, 2019 - proceedings.mlr.press
Many popular first-order optimization methods accelerate the convergence rate of deep
learning models. However, these algorithms require auxiliary variables, which cost …

Tricking the hashing trick: A tight lower bound on the robustness of countsketch to adaptive inputs

E Cohen, J Nelson, T Sarlós, U Stemmer - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Abstract CountSketch and Feature Hashing (the``hashing trick'') are popular randomized
dimensionality reduction methods that support recovery of l2-heavy hitters and approximate …

Asymptotically free sketched ridge ensembles: Risks, cross-validation, and tuning

P Patil, D LeJeune - arxiv preprint arxiv:2310.04357, 2023 - arxiv.org
We employ random matrix theory to establish consistency of generalized cross validation
(GCV) for estimating prediction risks of sketched ridge regression ensembles, enabling …