Convergence of entropy-regularized natural policy gradient with linear function approximation

S Cayci, N He, R Srikant - SIAM Journal on Optimization, 2024 - SIAM
Natural policy gradient (NPG) methods, equipped with function approximation and entropy
regularization, achieve impressive empirical success in reinforcement learning problems …

Information-theoretic generalization bounds for learning from quantum data

MC Caro, T Gur, C Rouzé, DS Franca… - The Thirty Seventh …, 2024 - proceedings.mlr.press
Learning tasks play an increasingly prominent role in quantum information and computation.
They range from fundamental problems such as state discrimination and metrology over the …

Information complexity of stochastic convex optimization: Applications to generalization and memorization

I Attias, GK Dziugaite, M Haghifam, R Livni… - arxiv preprint arxiv …, 2024 - arxiv.org
In this work, we investigate the interplay between memorization and learning in the context
of\emph {stochastic convex optimization}(SCO). We define memorization via the information …

Exactly tight information-theoretic generalization error bound for the quadratic gaussian problem

R Zhou, C Tian, T Liu - IEEE Journal on Selected Areas in …, 2024 - ieeexplore.ieee.org
We provide a new information-theoretic generalization error bound that is exactly tight (ie,
matching even the constant) for the canonical quadratic Gaussian (location) problem. Most …

An information-theoretic approach to generalization theory

B Rodríguez-Gálvez, R Thobaben… - arxiv preprint arxiv …, 2024 - arxiv.org
We investigate the in-distribution generalization of machine learning algorithms. We depart
from traditional complexity-based approaches by analyzing information-theoretic bounds …

Empirical risk minimization with relative entropy regularization

SM Perlaza, G Bisson, I Esnaola… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
The empirical risk minimization (ERM) problem with relative entropy regularization (ERM-
RER) is investigated under the assumption that the reference measure is a σ-finite measure …

Tighter generalisation bounds via interpolation

P Viallard, M Haddouche, U Şimşekli… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper contains a recipe for deriving new PAC-Bayes generalisation bounds based on
the $(f,\Gamma) $-divergence, and, in addition, presents PAC-Bayes generalisation bounds …

Information-Theoretic Generalization Bounds for Transductive Learning and its Applications

H Tang, Y Liu - arxiv preprint arxiv:2311.04561, 2023 - arxiv.org
In this paper, we develop data-dependent and algorithm-dependent generalization bounds
for transductive learning algorithms in the context of information theory for the first time. We …

Generalization and informativeness of conformal prediction

M Zecchin, S Park, O Simeone, F Hellström - arxiv preprint arxiv …, 2024 - arxiv.org
The safe integration of machine learning modules in decision-making processes hinges on
their ability to quantify uncertainty. A popular technique to achieve this goal is conformal …

Empirical risk minimization with relative entropy regularization type-II

F Daunas, I Esnaola, SM Perlaza, HV Poor - 2023 - hal.science
The effect of the relative entropy asymmetry is analyzed in the empirical risk minimization
with relative entropy regularization (ERM-RER) problem. A novel regularization is …