Tighter Theory for Local SGD on Identical and Heterogeneous Data A Khaled, K Mishchenko, P Richtárik AISTATS 2020 (arXiv:1909.04746), 2020 | 502 | 2020 |
Better theory for SGD in the nonconvex world A Khaled, P Richtárik TMLR, 2020 | 216 | 2020 |
First analysis of local GD on heterogeneous data A Khaled, K Mishchenko, P Richtárik arXiv preprint arXiv:1909.04715, 2019 | 192 | 2019 |
Random Reshuffling: Simple Analysis with Vast Improvements K Mishchenko, A Khaled, P Richtárik NeurIPS 2020 (arXiv:2006.05988), 2020 | 159 | 2020 |
Unified Analysis of Stochastic Gradient Methods for Composite Convex and Smooth Optimization A Khaled, O Sebbouh, N Loizou, RM Gower, P Richtárik JOTA, 2020 | 51 | 2020 |
Proximal and federated random reshuffling K Mishchenko, A Khaled, P Richtárik International Conference on Machine Learning, 15718-15749, 2022 | 47 | 2022 |
Better Communication Complexity for Local SGD A Khaled, K Mishchenko, P Richtárik arXiv preprint arXiv:1909.04746v1, 2019 | 33 | 2019 |
FLIX: A Simple and Communication-Efficient Alternative to Local Methods in Federated Learning E Gasanov, A Khaled, S Horváth, P Richtárik AISTATS 2022 (arXiv:2111.11556), 2021 | 32 | 2021 |
Gradient descent with compressed iterates A Khaled, P Richtárik arXiv preprint arXiv:1909.04716, 2019 | 30 | 2019 |
Federated optimization algorithms with random reshuffling and gradient compression A Sadiev, G Malinovsky, E Gorbunov, I Sokolov, A Khaled, K Burlachenko, ... arXiv preprint arXiv:2206.07021, 2022 | 27 | 2022 |
Distributed fixed point methods with compressed iterates S Chraibi, A Khaled, D Kovalev, P Richtárik, A Salim, M Takáč arXiv preprint arXiv:1912.09925, 2019 | 26 | 2019 |
The road less scheduled A Defazio, XA Yang, H Mehta, K Mishchenko, A Khaled, A Cutkosky arXiv preprint arXiv:2405.15682, 2024 | 22 | 2024 |
DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method A Khaled, K Mishchenko, C Jin NeurIPS 2023, 2023 | 21 | 2023 |
Faster federated optimization under second-order similarity A Khaled, C Jin ICLR 2023, 2022 | 18 | 2022 |
Applying fast matrix multiplication to neural networks A Khaled, AF Atiya, AH Abdel-Gawad Proceedings of the 35th Annual ACM Symposium on Applied Computing, 1034-1037, 2020 | 11 | 2020 |
Tuning-Free Stochastic Optimization A Khaled, C Jin ICML 2024, 2024 | 7 | 2024 |
Directional Smoothness and Gradient Methods: Convergence and Adaptivity A Mishkin, A Khaled, Y Wang, A Defazio, RM Gower arXiv preprint arXiv:2403.04081, 2024 | 5 | 2024 |
A novel analysis of gradient descent under directional smoothness A Mishkin, A Khaled, A Defazio, RM Gower OPT 2023: Optimization for Machine Learning, 0 | | |