The curious price of distributional robustness in reinforcement learning with a generative model

L Shi, G Li, Y Wei, Y Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
This paper investigates model robustness in reinforcement learning (RL) via the framework
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …

Leveraging factored action spaces for efficient offline reinforcement learning in healthcare

S Tang, M Makar, M Sjoding… - Advances in Neural …, 2022 - proceedings.neurips.cc
Many reinforcement learning (RL) applications have combinatorial action spaces, where
each action is a composition of sub-actions. A standard RL approach ignores this inherent …

Event-centric temporal knowledge graph construction: A survey

T Knez, S Žitnik - Mathematics, 2023 - mdpi.com
Textual documents serve as representations of discussions on a variety of subjects. These
discussions can vary in length and may encompass a range of events or factual information …

An effective negotiating agent framework based on deep offline reinforcement learning

S Chen, J Zhao, G Weiss, R Su… - Uncertainty in Artificial …, 2023 - proceedings.mlr.press
Learning is crucial for automated negotiation, and recent years have witnessed a
remarkable achievement in application of reinforcement learning (RL) for various …

Provably efficient risk-sensitive reinforcement learning: Iterated cvar and worst path

Y Du, S Wang, L Huang - arxiv preprint arxiv:2206.02678, 2022 - arxiv.org
In this paper, we study a novel episodic risk-sensitive Reinforcement Learning (RL) problem,
named Iterated CVaR RL, which aims to maximize the tail of the reward-to-go at each step …

Continuous-Time decision transformer for healthcare applications

Z Zhang, H Mei, Y Xu - International Conference on Artificial …, 2023 - proceedings.mlr.press
Offline reinforcement learning (RL) is a promising approach for training intelligent medical
agents to learn treatment policies and assist decision making in many healthcare …

Learning general world models in a handful of reward-free deployments

Y Xu, J Parker-Holder, A Pacchiano… - Advances in …, 2022 - proceedings.neurips.cc
Building generally capable agents is a grand challenge for deep reinforcement learning
(RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate …

Connected and automated vehicles in mixed-traffic: Learning human driver behavior for effective on-ramp merging

N Venkatesh, VA Le, A Dave… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org
Highway merging scenarios featuring mixed traffic conditions pose significant modeling and
control challenges for connected and automated vehicles (CAVs) interacting with incoming …

Deep offline reinforcement learning for real-world treatment optimization applications

M Nambiar, S Ghosh, P Ong, YE Chan… - Proceedings of the 29th …, 2023 - dl.acm.org
There is increasing interest in data-driven approaches for recommending optimal treatment
strategies in many chronic disease management and critical care applications …

[PDF][PDF] Risk-aware reinforcement learning with coherent risk measures and non-linear function approximation

T Lam, A Verma, BKH Low, P Jaillet - The Eleventh International …, 2022 - drive.google.com
We study the risk-aware reinforcement learning (RL) problem in the episodic finite-horizon
Markov decision process with unknown transition and reward functions. In contrast to the risk …