Multi-agent reinforcement learning: A selective overview of theories and algorithms
Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …
has registered tremendous success in solving various sequential decision-making problems …
[HTML][HTML] Applications of reinforcement learning in energy systems
Energy systems undergo major transitions to facilitate the large-scale penetration of
renewable energy technologies and improve efficiencies, leading to the integration of many …
renewable energy technologies and improve efficiencies, leading to the integration of many …
Bilinear classes: A structural framework for provable generalization in rl
Abstract This work introduces Bilinear Classes, a new structural framework, which permit
generalization in reinforcement learning in a wide variety of settings through the use of …
generalization in reinforcement learning in a wide variety of settings through the use of …
Bellman eluder dimension: New rich classes of rl problems, and sample-efficient algorithms
Finding the minimal structural assumptions that empower sample-efficient learning is one of
the most important research directions in Reinforcement Learning (RL). This paper …
the most important research directions in Reinforcement Learning (RL). This paper …
The statistical complexity of interactive decision making
A fundamental challenge in interactive learning and decision making, ranging from bandit
problems to reinforcement learning, is to provide sample-efficient, adaptive learning …
problems to reinforcement learning, is to provide sample-efficient, adaptive learning …
Nearly minimax optimal reinforcement learning for linear mixture markov decision processes
We study reinforcement learning (RL) with linear function approximation where the
underlying transition probability kernel of the Markov decision process (MDP) is a linear …
underlying transition probability kernel of the Markov decision process (MDP) is a linear …
A review of cooperative multi-agent deep reinforcement learning
Abstract Deep Reinforcement Learning has made significant progress in multi-agent
systems in recent years. The aim of this review article is to provide an overview of recent …
systems in recent years. The aim of this review article is to provide an overview of recent …
Model-based reinforcement learning with value-targeted regression
This paper studies model-based reinforcement learning (RL) for regret minimization. We
focus on finite-horizon episodic RL where the transition model $ P $ belongs to a known …
focus on finite-horizon episodic RL where the transition model $ P $ belongs to a known …
Policy finetuning: Bridging sample-efficient offline and online reinforcement learning
Recent theoretical work studies sample-efficient reinforcement learning (RL) extensively in
two settings: learning interactively in the environment (online RL), or learning from an offline …
two settings: learning interactively in the environment (online RL), or learning from an offline …
Information-theoretic considerations in batch reinforcement learning
Value-function approximation methods that operate in batch mode have foundational
importance to reinforcement learning (RL). Finite sample guarantees for these methods …
importance to reinforcement learning (RL). Finite sample guarantees for these methods …