- Academic Search

M Ghavamzadeh, S Mannor, J Pineau… - … and Trends® in …, 2015 - nowpublishers.com

Bayesian methods for machine learning have been widely investigated, yielding principled
methods for incorporating prior information into inference algorithms. In this survey, we …

Salva Cita Citato da 593 Articoli correlati Tutte e 11 le versioni Ricerca biblioteche Versione HTML

[Free GPT-4]

[PDF] jmlr.org

A unified recipe for deriving (time-uniform) PAC-Bayes bounds

B Chugg, H Wang, A Ramdas - Journal of Machine Learning Research, 2023 - jmlr.org

We present a unified framework for deriving PAC-Bayesian generalization bounds. Unlike
most previous literature on this topic, our bounds are anytime-valid (ie, time-uniform) …

Salva Cita Citato da 43 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Risk bounds for the majority vote: From a PAC-Bayesian analysis to a learning algorithm

P Germain, A Lacasse, F Laviolette… - ar** deep learning models with uncertainty estimates in the form of set-valued …

Salva Cita Citato da 6 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] Fast rates in statistical and online learning

T Van Erven, PD Grünwald, NA Mehta, MD Reid… - The Journal of Machine …, 2015 - jmlr.org

The speed with which a learning algorithm converges as it is presented with more data is a
central problem in machine learning—a fast rate of convergence means less data is needed …

Salva Cita Citato da 122 Articoli correlati Tutte e 15 le versioni Versione HTML

[Free GPT-4]

[PDF] jmlr.org

[PDF][PDF] Bayesian nonparametric covariance regression

EB Fox, DB Dunson - The Journal of Machine Learning Research, 2015 - jmlr.org

Capturing predictor-dependent correlations amongst the elements of a multivariate
response vector is fundamental to numerous applied domains, including neuroscience …

Salva Cita Citato da 95 Articoli correlati Tutte e 5 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

PAC-Bayesian lifelong learning for multi-armed bandits

H Flynn, D Reeb, M Kandemir, J Peters - Data Mining and Knowledge …, 2022 - Springer

We present a PAC-Bayesian analysis of lifelong learning. In the lifelong learning problem, a
sequence of learning tasks is observed one-at-a-time, and the goal is to transfer information …

Salva Cita Citato da 14 Articoli correlati Tutte e 7 le versioni

[Free GPT-4]

[PDF] arxiv.org

PAC-Bayesian soft actor-critic learning

B Tasdighi, A Akgül, M Haussmann, KK Brink… - arxiv preprint arxiv …, 2023 - arxiv.org

Actor-critic algorithms address the dual goals of reinforcement learning (RL), policy
evaluation and improvement via two separate function approximators. The practicality of this …

Salva Cita Citato da 5 Articoli correlati Tutte e 3 le versioni Versione HTML

[Free GPT-4]

[PDF] aclanthology.org

[PDF][PDF] Policy learning for domain selection in an extensible multi-domain spoken dialogue system

Z Wang, H Chen, G Wang, H Tian, H Wu… - Proceedings of the …, 2014 - aclanthology.org

This paper proposes a Markov Decision Process and reinforcement learning based
approach for domain selection in a multidomain Spoken Dialogue System built on a …

Salva Cita Citato da 46 Articoli correlati Tutte e 8 le versioni Versione HTML

[Free GPT-4]

[PDF] sagepub.com

PAC-Bayes control: learning policies that provably generalize to novel environments

A Majumdar, A Farid, A Sonar - The International Journal of …, 2021 - journals.sagepub.com

Our goal is to learn control policies for robots that provably generalize well to novel
environments given a dataset of example environments. The key technical idea behind our …

Salva Cita Citato da 37 Articoli correlati Tutte e 7 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

PAC-Bayesian policy evaluation for reinforcement learning

Bayesian reinforcement learning: A survey

A unified recipe for deriving (time-uniform) PAC-Bayes bounds

Risk bounds for the majority vote: From a PAC-Bayesian analysis to a learning algorithm

[PDF][PDF] Fast rates in statistical and online learning

[PDF][PDF] Bayesian nonparametric covariance regression

PAC-Bayesian lifelong learning for multi-armed bandits

PAC-Bayesian soft actor-critic learning

[PDF][PDF] Policy learning for domain selection in an extensible multi-domain spoken dialogue system

PAC-Bayes control: learning policies that provably generalize to novel environments