Distributed optimization with arbitrary local solvers

C Ma, J Konečný, M Jaggi, V Smith… - optimization Methods …, 2017 - Taylor & Francis
With the growth of data and necessity for distributed optimization methods, solvers that work
well on a single machine must be re-designed to leverage distributed computation. Recent …

Random-projection ensemble classification

TI Cannings, RJ Samworth - Journal of the Royal Statistical …, 2017 - academic.oup.com
We introduce a very general method for high dimensional classification, based on careful
combination of the results of applying an arbitrary base classifier to random projections of …

Adding vs. averaging in distributed primal-dual optimization

C Ma, V Smith, M Jaggi, M Jordan… - International …, 2015 - proceedings.mlr.press
Distributed optimization methods for large-scale machine learning suffer from a
communication bottleneck. It is difficult to reduce this bottleneck while still efficiently and …

Random projections: Data perturbation for classification problems

TI Cannings - Wiley Interdisciplinary Reviews: Computational …, 2021 - Wiley Online Library
Random projections offer an appealing and flexible approach to a wide range of large‐scale
statistical problems. They are particularly useful in high‐dimensional settings, where we …

Random projections for large-scale regression

GA Thanei, C Heinze, N Meinshausen - Big and Complex Data Analysis …, 2017 - Springer
Fitting linear regression models can be computationally very expensive in large-scale data
analysis tasks if the sample size and the number of variables are very large. Random …

Stochastic, distributed and federated optimization for machine learning

J Konečný - arxiv preprint arxiv:1707.01155, 2017 - arxiv.org
We study optimization algorithms for the finite sum problems frequently arising in machine
learning applications. First, we propose novel variants of stochastic gradient descent with a …

Sketching meets random projection in the dual: A provable recovery algorithm for big and high-dimensional data

J Wang, J Lee, M Mahdavi, M Kolar… - Artificial Intelligence …, 2017 - proceedings.mlr.press
Sketching techniques scale up machine learning algorithms by reducing the sample size or
dimensionality of massive data sets, without sacrificing their statistical properties. In this …

Dual-loco: Distributing statistical estimation using random projections

C Heinze, B McWilliams… - Artificial intelligence …, 2016 - proceedings.mlr.press
We present DUAL-LOCO, a communication-efficient algorithm for distributed statistical
estimation. DUAL-LOCO assumes that the data is distributed across workers according to …

Federated learning of models pre-trained on different features with consensus graphs

T Ma, TN Hoang, J Chen - Uncertainty in Artificial …, 2023 - proceedings.mlr.press
Learning an effective global model on private and decentralized datasets has become an
increasingly important challenge of machine learning when applied in practice. Existing …

Variable selection using axis-aligned random projections for partial least-squares regression

Y Lin, X Zeng, P Wang, S Huang, KL Teo - Statistics and Computing, 2024 - Springer
In high-dimensional data modeling, variable selection plays a crucial role in improving
predictive accuracy and enhancing model interpretability through sparse representation …