A tighter complexity analysis of sparsegpt

X Li, Y Liang, Z Shi, Z Song - arxiv preprint arxiv:2408.12151, 2024 - arxiv.org
In this work, we improved the analysis of the running time of SparseGPT [Frantar, Alistarh
ICML 2023] from $ O (d^{3}) $ to $ O (d^{\omega}+ d^{2+ a+ o (1)}+ d^{1+\omega (1, 1, a)-a}) …

Training overparametrized neural networks in sublinear time

Y Deng, H Hu, Z Song, O Weinstein, D Zhuo - arxiv preprint arxiv …, 2022 - arxiv.org
The success of deep learning comes at a tremendous computational and energy cost, and
the scalability of training massively overparametrized neural networks is becoming a real …

A quantum speed-up for approximating the top eigenvectors of a matrix

Y Chen, A Gilyén, R de Wolf - Proceedings of the 2025 Annual ACM-SIAM …, 2025 - SIAM
Finding a good approximation of the top eigenvector of a given dxd matrix A is a basic and
important computational problem, with many applications. We give two different quantum …

Closing the Computational-Query Depth Gap in Parallel Stochastic Convex Optimization

A Jambulapati, A Sidford, K Tian - The Thirty Seventh Annual …, 2024 - proceedings.mlr.press
We develop a new parallel algorithm for minimizing Lipschitz, convex functions with a
stochastic subgradient oracle. The total number of queries made and the query depth, ie, the …

Invariant subspaces and PCA in nearly matrix multiplication time

A Sobczyk, M Mladenović, M Luisier - arxiv preprint arxiv:2311.10459, 2023 - arxiv.org
Approximating invariant subspaces of generalized eigenvalue problems (GEPs) is a
fundamental computational problem at the core of machine learning and scientific …

Faster Cycle Detection in the Congested Clique

K Censor-Hillel, T Even, VV Williams - arxiv preprint arxiv:2408.15132, 2024 - arxiv.org
We provide a fast distributed algorithm for detecting $ h $-cycles in the\textsf {Congested
Clique} model, whose running time decreases as the number of $ h $-cycles in the graph …

Computationally Faster Newton Methods by Lazy Evaluations

L Chen, C Liu, L Luo, J Zhang - arxiv preprint arxiv:2501.17488, 2025 - arxiv.org
This paper studies second-order optimization methods solving monotone nonlinear
equation problems (MNE) and minimization problems (Min) in a $ d $ dimensional vector …

Faster Weighted and Unweighted Tree Edit Distance and APSP Equivalence

J Nogler, A Polak, B Saha, VV Williams, Y Xu… - arxiv preprint arxiv …, 2024 - arxiv.org
The tree edit distance (TED) between two rooted ordered trees with $ n $ nodes labeled
from an alphabet $\Sigma $ is the minimum cost of transforming one tree into the other by a …

Deterministic complexity analysis of Hermitian eigenproblems

A Sobczyk - arxiv preprint arxiv:2410.21550, 2024 - arxiv.org
In this work we revisit the arithmetic and bit complexity of Hermitian eigenproblems. We first
provide an analysis for the divide-and-conquer tridiagonal eigensolver of Gu and Eisenstat …

On Incremental Approximate Shortest Paths in Directed Graphs

A Górkiewicz, A Karczmarz - arxiv preprint arxiv:2502.10348, 2025 - arxiv.org
In this paper, we show new data structures maintaining approximate shortest paths in sparse
directed graphs with polynomially bounded non-negative edge weights under edge …