- Academic Search

J Kwon, Y Efroni, C Caramanis… - Advances in Neural …, 2021 - proceedings.neurips.cc

In this work, we consider the regret minimization problem for reinforcement learning in latent
Markov Decision Processes (LMDP). In an LMDP, an MDP is randomly drawn from a set of …

Save Cite Cited by 86 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Learning mixtures of linear dynamical systems

Y Chen, HV Poor - International conference on machine …, 2022 - proceedings.mlr.press

We study the problem of learning a mixture of multiple linear dynamical systems (LDSs) from
unlabeled short sample trajectories, each generated by one of the LDS models. Despite the …

Save Cite Cited by 23 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Provably efficient multi-task reinforcement learning with model transfer

C Zhang, Z Wang - Advances in Neural Information …, 2021 - proceedings.neurips.cc

We study multi-task reinforcement learning (RL) in tabular episodic Markov decision
processes (MDPs). We formulate a heterogeneous multi-player RL problem, in which a …

Save Cite Cited by 18 Related articles All 11 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Reward-mixing mdps with few latent contexts are learnable

J Kwon, Y Efroni, C Caramanis… - … on Machine Learning, 2023 - proceedings.mlr.press

We consider episodic reinforcement learning in reward-mixing Markov decision processes
(RMMDPs): at the beginning of every episode nature randomly picks a latent reward model …

Save Cite Cited by 7 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Sequential transfer in reinforcement learning with a generative model

A Tirinzoni, R Poiani, M Restelli - … Conference on Machine …, 2020 - proceedings.mlr.press

We are interested in how to design reinforcement learning agents that provably reduce the
sample complexity for learning new tasks by transferring knowledge from previously-solved …

Save Cite Cited by 26 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] aaai.org

Temple: Learning template of transitions for sample efficient multi-task rl

Y Sun, X Yin, F Huang - Proceedings of the AAAI Conference on …, 2021 - ojs.aaai.org

Transferring knowledge among various environments is important for efficiently learning
multiple tasks online. Most existing methods directly use the previously learned models or …

Save Cite Cited by 24 Related articles All 9 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Horizon-free and variance-dependent reinforcement learning for latent markov decision processes

R Zhou, R Wang, SS Du - International Conference on …, 2023 - proceedings.mlr.press

We study regret minimization for reinforcement learning (RL) in Latent Markov Decision
Processes (LMDPs) with context in hindsight. We design a novel model-based algorithmic …

Save Cite Cited by 3 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Near-Optimal Learning and Planning in Separated Latent MDPs

F Chen, C Daskalakis, N Golowich… - arxiv preprint arxiv …, 2024 - arxiv.org

We study computational and statistical aspects of learning Latent Markov Decision
Processes (LMDPs). In this model, the learner interacts with an MDP drawn at the beginning …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Bayesian residual policy optimization:: Scalable bayesian reinforcement learning with clairvoyant experts

G Lee, B Hou, S Choudhury… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org

Informed and robust decision making in the face of uncertainty is critical for robots operating
in unstructured environments. We formulate this as Bayesian Reinforcement Learning over …

Save Cite Cited by 9 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] utexas.edu

Statistical learning with latent variables: mixture models and reinforcement learning

J Kwon - 2022 - repositories.lib.utexas.edu

Statistical learning with missing or hidden information is ubiquitous in many practical
problems. For example, the success of a certain medical treatment can largely depend on …

Save Cite Cited by 1 Related articles All 2 versions Free GPT-4 Library Search View as HTML

Create alert

Cite

Advanced search

Saved to My library

Pac continuous state online multitask reinforcement learning with identification

Rl for latent mdps: Regret guarantees and a lower bound

Learning mixtures of linear dynamical systems

Provably efficient multi-task reinforcement learning with model transfer

Reward-mixing mdps with few latent contexts are learnable

Sequential transfer in reinforcement learning with a generative model

Temple: Learning template of transitions for sample efficient multi-task rl

Horizon-free and variance-dependent reinforcement learning for latent markov decision processes

Near-Optimal Learning and Planning in Separated Latent MDPs

Bayesian residual policy optimization:: Scalable bayesian reinforcement learning with clairvoyant experts

Statistical learning with latent variables: mixture models and reinforcement learning