Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
A survey of meta-reinforcement learning
While deep reinforcement learning (RL) has fueled multiple high-profile successes in
machine learning, it is held back from more widespread adoption by its often poor data …
machine learning, it is held back from more widespread adoption by its often poor data …
Efficient reinforcement learning in block mdps: A model-free representation learning approach
We present BRIEE, an algorithm for efficient reinforcement learning in Markov Decision
Processes with block-structured dynamics (ie, Block MDPs), where rich observations are …
Processes with block-structured dynamics (ie, Block MDPs), where rich observations are …
Bayesian decision-making under misspecified priors with applications to meta-learning
M Simchowitz, C Tosh… - Advances in neural …, 2021 - proceedings.neurips.cc
Thompson sampling and other Bayesian sequential decision-making algorithms are among
the most popular approaches to tackle explore/exploit trade-offs in (contextual) bandits. The …
the most popular approaches to tackle explore/exploit trade-offs in (contextual) bandits. The …
Meta-thompson sampling
Efficient exploration in bandits is a fundamental online learning problem. We propose a
variant of Thompson sampling that learns to explore better as it interacts with bandit …
variant of Thompson sampling that learns to explore better as it interacts with bandit …
Offline multi-task transfer rl with representational penalization
We study the problem of representation transfer in offline Reinforcement Learning (RL),
where a learner has access to episodic data from a number of source tasks collected a …
where a learner has access to episodic data from a number of source tasks collected a …
Provable benefits of representational transfer in reinforcement learning
We study the problem of representational transfer in RL, where an agent first pretrains in a
number of\emph {source tasks} to discover a shared representation, which is subsequently …
number of\emph {source tasks} to discover a shared representation, which is subsequently …
Hierarchical bayesian bandits
Abstract Meta-, multi-task, and federated learning can be all viewed as solving similar tasks,
drawn from a distribution that reflects task similarities. We provide a unified view of all these …
drawn from a distribution that reflects task similarities. We provide a unified view of all these …
Provable benefit of multitask representation learning in reinforcement learning
As representation learning becomes a powerful technique to reduce sample complexity in
reinforcement learning (RL) in practice, theoretical understanding of its advantage is still …
reinforcement learning (RL) in practice, theoretical understanding of its advantage is still …
Probabilistic design of optimal sequential decision-making algorithms in learning and control
This survey is focused on certain sequential decision-making problems that involve
optimizing over probability functions. We discuss the relevance of these problems for …
optimizing over probability functions. We discuss the relevance of these problems for …
No regrets for learning the prior in bandits
Abstract We propose AdaTS, a Thompson sampling algorithm that adapts sequentially to
bandit tasks that it interacts with. The key idea in AdaTS is to adapt to an unknown task prior …
bandit tasks that it interacts with. The key idea in AdaTS is to adapt to an unknown task prior …