Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Communication-efficient distributed SGD with sketching
Large-scale distributed training of neural networks is often limited by network bandwidth,
wherein the communication time overwhelms the local computation time. Motivated by the …
wherein the communication time overwhelms the local computation time. Motivated by the …
[PDF][PDF] Learning-Based Frequency Estimation Algorithms.
Estimating the frequencies of elements in a data stream is a fundamental task in data
analysis and machine learning. The problem is typically addressed using streaming …
analysis and machine learning. The problem is typically addressed using streaming …
To petabytes and beyond: recent advances in probabilistic and signal processing algorithms and their application to metagenomics
As computational biologists continue to be inundated by ever increasing amounts of
metagenomic data, the need for data analysis approaches that keep up with the pace of …
metagenomic data, the need for data analysis approaches that keep up with the pace of …
Improved frequency estimation algorithms with and without predictions
Estimating frequencies of elements appearing in a data stream is a key task in large-scale
data analysis. Popular sketching approaches to this problem (eg, CountMin and …
data analysis. Popular sketching approaches to this problem (eg, CountMin and …
On the robustness of countsketch to adaptive inputs
The last decade saw impressive progress towards understanding the performance of
algorithms in adaptive settings, where subsequent inputs may depend on the output from …
algorithms in adaptive settings, where subsequent inputs may depend on the output from …
Asymptotics of the sketched pseudoinverse
We take a random matrix theory approach to random sketching and show an asymptotic first-
order equivalence of the regularized sketched pseudoinverse of a positive semidefinite …
order equivalence of the regularized sketched pseudoinverse of a positive semidefinite …
Quick and robust feature selection: the strength of energy-efficient sparse training for autoencoders
Major complications arise from the recent increase in the amount of high-dimensional data,
including high computational costs and memory requirements. Feature selection, which …
including high computational costs and memory requirements. Feature selection, which …
Compressing gradient optimizers via count-sketches
Many popular first-order optimization methods accelerate the convergence rate of deep
learning models. However, these algorithms require auxiliary variables, which cost …
learning models. However, these algorithms require auxiliary variables, which cost …
Tricking the hashing trick: A tight lower bound on the robustness of countsketch to adaptive inputs
Abstract CountSketch and Feature Hashing (the``hashing trick'') are popular randomized
dimensionality reduction methods that support recovery of l2-heavy hitters and approximate …
dimensionality reduction methods that support recovery of l2-heavy hitters and approximate …
Asymptotically free sketched ridge ensembles: Risks, cross-validation, and tuning
We employ random matrix theory to establish consistency of generalized cross validation
(GCV) for estimating prediction risks of sketched ridge regression ensembles, enabling …
(GCV) for estimating prediction risks of sketched ridge regression ensembles, enabling …