Tensor attention training: Provably efficient learning of higher-order transformers

Y Liang, Z Shi, Z Song, Y Zhou - arxiv preprint arxiv:2405.16411, 2024 - arxiv.org
Tensor Attention, a multi-view attention that is able to capture high-order correlations among
multiple modalities, can overcome the representational limitations of classical matrix …

Agnostic estimation of mean and covariance

KA Lai, AB Rao, S Vempala - 2016 IEEE 57th Annual …, 2016 - ieeexplore.ieee.org
We consider the problem of estimating the mean and covariance of a distribution from iid
samples in the presence of a fraction of malicious noise. This is in contrast to much recent …

Robust moment estimation and improved clustering via sum of squares

PK Kothari, J Steinhardt, D Steurer - … of the 50th Annual ACM SIGACT …, 2018 - dl.acm.org
We develop efficient algorithms for estimating low-degree moments of unknown distributions
in the presence of adversarial outliers and design a new family of convex relaxations for k …

Dictionary learning and tensor decomposition via the sum-of-squares method

B Barak, JA Kelner, D Steurer - Proceedings of the forty-seventh annual …, 2015 - dl.acm.org
We give a new approach to the dictionary learning (also known as" sparse coding") problem
of recovering an unknown nxm matrix A (for m≥ n) from examples of the form y= Ax+ e …

Relative error tensor low rank approximation

Z Song, DP Woodruff, P Zhong - Proceedings of the Thirtieth Annual ACM …, 2019 - SIAM
We consider relative error low rank approximation of tensors with respect to the Frobenius
norm. Namely, given an order-q tensor A∊ ℝ∏ i= 1 q ni, output a rank-k tensor B for which …

Smoothed analysis of tensor decompositions

A Bhaskara, M Charikar, A Moitra… - Proceedings of the forty …, 2014 - dl.acm.org
Low rank decomposition of tensors is a powerful tool for learning generative models. The
uniqueness results that hold for tensors give them a significant advantage over matrices …

Reconstructing training data from model gradient, provably

Z Wang, J Lee, Q Lei - International Conference on Artificial …, 2023 - proceedings.mlr.press
Understanding when and how much a model gradient leaks information about the training
sample is an important question in privacy. In this paper, we present a surprising result …

On learning mixtures of well-separated gaussians

O Regev, A Vijayaraghavan - 2017 IEEE 58th Annual …, 2017 - ieeexplore.ieee.org
We consider the problem of efficiently learning mixtures of a large number of spherical
Gaussians, when the components of the mixture are well separated. In the most basic form …

Joint sensing, communication, and AI: A trifecta for resilient THz user experiences

C Chaccour, W Saad, M Debbah… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this paper a novel joint sensing, communication, and artificial intelligence (AI) framework
is proposed so as to optimize extended reality (XR) experiences over terahertz (THz) …

An algorithm for generic and low-rank specific identifiability of complex tensors

L Chiantini, G Ottaviani, N Vannieuwenhoven - SIAM Journal on Matrix …, 2014 - SIAM
We propose a new sufficient condition for verifying whether general rank-r complex tensors
of arbitrary order admit a unique decomposition as a linear combination of rank-1 tensors. A …