The curious price of distributional robustness in reinforcement learning with a generative model
This paper investigates model robustness in reinforcement learning (RL) via the framework
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …
Leveraging factored action spaces for efficient offline reinforcement learning in healthcare
Many reinforcement learning (RL) applications have combinatorial action spaces, where
each action is a composition of sub-actions. A standard RL approach ignores this inherent …
each action is a composition of sub-actions. A standard RL approach ignores this inherent …
Event-centric temporal knowledge graph construction: A survey
Textual documents serve as representations of discussions on a variety of subjects. These
discussions can vary in length and may encompass a range of events or factual information …
discussions can vary in length and may encompass a range of events or factual information …
An effective negotiating agent framework based on deep offline reinforcement learning
Learning is crucial for automated negotiation, and recent years have witnessed a
remarkable achievement in application of reinforcement learning (RL) for various …
remarkable achievement in application of reinforcement learning (RL) for various …
Provably efficient risk-sensitive reinforcement learning: Iterated cvar and worst path
In this paper, we study a novel episodic risk-sensitive Reinforcement Learning (RL) problem,
named Iterated CVaR RL, which aims to maximize the tail of the reward-to-go at each step …
named Iterated CVaR RL, which aims to maximize the tail of the reward-to-go at each step …
Continuous-Time decision transformer for healthcare applications
Offline reinforcement learning (RL) is a promising approach for training intelligent medical
agents to learn treatment policies and assist decision making in many healthcare …
agents to learn treatment policies and assist decision making in many healthcare …
Learning general world models in a handful of reward-free deployments
Building generally capable agents is a grand challenge for deep reinforcement learning
(RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate …
(RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate …
Connected and automated vehicles in mixed-traffic: Learning human driver behavior for effective on-ramp merging
Highway merging scenarios featuring mixed traffic conditions pose significant modeling and
control challenges for connected and automated vehicles (CAVs) interacting with incoming …
control challenges for connected and automated vehicles (CAVs) interacting with incoming …
Deep offline reinforcement learning for real-world treatment optimization applications
There is increasing interest in data-driven approaches for recommending optimal treatment
strategies in many chronic disease management and critical care applications …
strategies in many chronic disease management and critical care applications …
[PDF][PDF] Risk-aware reinforcement learning with coherent risk measures and non-linear function approximation
We study the risk-aware reinforcement learning (RL) problem in the episodic finite-horizon
Markov decision process with unknown transition and reward functions. In contrast to the risk …
Markov decision process with unknown transition and reward functions. In contrast to the risk …