On the convergence of fedavg on non-iid data
Federated learning enables a large amount of edge computing devices to jointly learn a
model without data sharing. As a leading algorithm in this setting, Federated Averaging …
model without data sharing. As a leading algorithm in this setting, Federated Averaging …
Gradient sparsification for communication-efficient distributed optimization
Modern large-scale machine learning applications require stochastic optimization
algorithms to be implemented on distributed computational architectures. A key bottleneck is …
algorithms to be implemented on distributed computational architectures. A key bottleneck is …
A review of distributed statistical inference
The rapid emergence of massive datasets in various fields poses a serious challenge to
traditional statistical methods. Meanwhile, it provides opportunities for researchers to …
traditional statistical methods. Meanwhile, it provides opportunities for researchers to …
[BOOK][B] Statistical foundations of data science
Statistical Foundations of Data Science gives a thorough introduction to commonly used
statistical models, contemporary statistical machine learning techniques and algorithms …
statistical models, contemporary statistical machine learning techniques and algorithms …
[HTML][HTML] Distributed testing and estimation under sparse high dimensional models
This paper studies hypothesis testing and parameter estimation in the context of the divide-
and-conquer algorithm. In a unified likelihood based framework, we propose new test …
and-conquer algorithm. In a unified likelihood based framework, we propose new test …
Distributed Computing and Inference for Big Data
L Zhou, Z Gong, P **ang - Annual Review of Statistics and Its …, 2023 - annualreviews.org
Data are distributed across different sites due to computing facility limitations or data privacy
considerations. Conventional centralized methods—those in which all datasets are stored …
considerations. Conventional centralized methods—those in which all datasets are stored …
Quantile regression under memory constraint
Quantile regression under memory constraint Page 1 The Annals of Statistics 2019, Vol. 47,
No. 6, 3244–3273 https://doi.org/10.1214/18-AOS1777 © Institute of Mathematical Statistics …
No. 6, 3244–3273 https://doi.org/10.1214/18-AOS1777 © Institute of Mathematical Statistics …
[HTML][HTML] Distributed estimation of principal eigenspaces
Principal component analysis (PCA) is fundamental to statistical machine learning. It extracts
latent principal factors that contribute to the most variation of the data. When data are stored …
latent principal factors that contribute to the most variation of the data. When data are stored …
Inference for multiple heterogeneous networks with a common invariant subspace
The development of models and methodology for the analysis of data from multiple
heterogeneous networks is of importance both in statistical network theory and across a …
heterogeneous networks is of importance both in statistical network theory and across a …
Communication-efficient accurate statistical estimation
When the data are stored in a distributed manner, direct applications of traditional statistical
inference procedures are often prohibitive due to communication costs and privacy …
inference procedures are often prohibitive due to communication costs and privacy …