[PDF][PDF] Tensor decompositions for learning latent variable models.

A Anandkumar, R Ge, DJ Hsu, SM Kakade… - J. Mach. Learn. Res …, 2014 - jmlr.org
This work considers a computationally and statistically efficient parameter estimation method
for a wide class of latent variable models—including Gaussian mixture models, hidden …

When is partially observable reinforcement learning not scary?

Q Liu, A Chung, C Szepesvári… - Conference on Learning …, 2022 - proceedings.mlr.press
Partial observability is ubiquitous in applications of Reinforcement Learning (RL), in which
agents learn to make a sequence of decisions despite lacking complete information about …

Robust estimators in high-dimensions without the computational intractability

I Diakonikolas, G Kamath, D Kane, J Li, A Moitra… - SIAM Journal on …, 2019 - SIAM
We study high-dimensional distribution learning in an agnostic setting where an adversary is
allowed to arbitrarily corrupt an ε-fraction of the samples. Such questions have a rich history …

Contrastive learning, multi-view redundancy, and linear models

C Tosh, A Krishnamurthy, D Hsu - Algorithmic Learning …, 2021 - proceedings.mlr.press
Self-supervised learning is an empirically successful approach to unsupervised learning
based on creating artificial supervised learning problems. A popular self-supervised …

Tensor attention training: Provably efficient learning of higher-order transformers

Y Liang, Z Shi, Z Song, Y Zhou - arxiv preprint arxiv:2405.16411, 2024 - arxiv.org
Tensor Attention, a multi-view attention that is able to capture high-order correlations among
multiple modalities, can overcome the representational limitations of classical matrix …

Near-optimal reinforcement learning with self-play

Y Bai, C **, T Yu - Advances in neural information …, 2020 - proceedings.neurips.cc
This paper considers the problem of designing optimal algorithms for reinforcement learning
in two-player zero-sum games. We focus on self-play algorithms which learn the optimal …

[PDF][PDF] Multi-objective reinforcement learning using sets of pareto dominating policies

K Van Moffaert, A Nowé - The Journal of Machine Learning Research, 2014 - jmlr.org
Many real-world problems involve the optimization of multiple, possibly conflicting
objectives. Multi-objective reinforcement learning (MORL) is a generalization of standard …

A spectral algorithm for learning hidden Markov models

D Hsu, SM Kakade, T Zhang - Journal of Computer and System Sciences, 2012 - Elsevier
Hidden Markov Models (HMMs) are one of the most fundamental and widely used statistical
tools for modeling discrete time series. In general, learning HMMs from data is …

Statistical query lower bounds for robust estimation of high-dimensional gaussians and gaussian mixtures

I Diakonikolas, DM Kane… - 2017 IEEE 58th Annual …, 2017 - ieeexplore.ieee.org
We describe a general technique that yields the first Statistical Query lower bounds for a
range of fundamental high-dimensional learning problems involving Gaussian distributions …

A method of moments for mixture models and hidden Markov models

A Anandkumar, D Hsu… - Conference on learning …, 2012 - proceedings.mlr.press
Mixture models are a fundamental tool in applied statistics and machine learning for treating
data taken from multiple subpopulations. The current practice for estimating the parameters …