An online and unified algorithm for projection matrix vector multiplication with application to empirical risk minimization

L Qin, Z Song, L Zhang, D Zhuo - … Conference on Artificial …, 2023 - proceedings.mlr.press
Online matrix vector multiplication is a fundamental step and bottleneck in many machine
learning algorithms. It is defined as follows: given a matrix at the pre-processing phase, at …

Communication lower bounds for statistical estimation problems via a distributed data processing inequality

M Braverman, A Garg, T Ma, HL Nguyen… - Proceedings of the forty …, 2016 - dl.acm.org
We study the tradeoff between the statistical error and communication cost of distributed
statistical estimation problems in high dimensions. In the distributed sparse Gaussian mean …

Sketching for first order method: efficient algorithm for low-bandwidth channel and vulnerability

Z Song, Y Wang, Z Yu, L Zhang - … Conference on Machine …, 2023 - proceedings.mlr.press
Sketching is one of the most fundamental tools in large-scale machine learning. It enables
runtime and memory saving via randomly compressing the original large problem into lower …

Sketching meets differential privacy: fast algorithm for dynamic kronecker projection maintenance

Z Song, X Yang, Y Yang… - … Conference on Machine …, 2023 - proceedings.mlr.press
Projection maintenance is one of the core data structure tasks. Efficient data structures for
projection maintenance have led to recent breakthroughs in many convex programming …

Low rank approximation with entrywise l1-norm error

Z Song, DP Woodruff, P Zhong - Proceedings of the 49th Annual ACM …, 2017 - dl.acm.org
We study the ℓ1-low rank approximation problem, where for a given nxd matrix A and
approximation factor α≤ 1, the goal is to output a rank-k matrix  for which‖ A-Â‖ 1≤ α …

Relative error tensor low rank approximation

Z Song, DP Woodruff, P Zhong - Proceedings of the Thirtieth Annual ACM …, 2019 - SIAM
We consider relative error low rank approximation of tensors with respect to the Frobenius
norm. Namely, given an order-q tensor A∊ ℝ∏ i= 1 q ni, output a rank-k tensor B for which …

Optimal principal component analysis in distributed and streaming models

C Boutsidis, DP Woodruff, P Zhong - … of the forty-eighth annual ACM …, 2016 - dl.acm.org
This paper studies the Principal Component Analysis (PCA) problem in the distributed and
streaming models of computation. Given a matrix A∈ R m× n, a rank parameter k< rank (A) …

Is solving graph neural tangent kernel equivalent to training graph neural network?

L Qin, Z Song, B Sun - arxiv preprint arxiv:2309.07452, 2023 - arxiv.org
A rising trend in theoretical deep learning is to understand why deep learning works through
Neural Tangent Kernel (NTK)[jgh18], a kernel method that is equivalent to using gradient …

Dynamic tensor product regression

A Reddy, Z Song, L Zhang - Advances in Neural …, 2022 - proceedings.neurips.cc
In this work, we initiate the study of\emph {Dynamic Tensor Product Regression}. One has
matrices $ A_1\in\mathbb {R}^{n_1\times d_1},\ldots, A_q\in\mathbb {R}^{n_q\times d_q} …