Google Académico

L Shi, G Li, Y Wei, Y Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper investigates model robustness in reinforcement learning (RL) via the framework
of distributionally robust Markov decision processes (RMDPs). Despite recent efforts, the …

Guardar Citar Citado por 38 Artículos relacionados Las 10 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Leveraging factored action spaces for efficient offline reinforcement learning in healthcare

S Tang, M Makar, M Sjoding… - Advances in Neural …, 2022 - proceedings.neurips.cc

Many reinforcement learning (RL) applications have combinatorial action spaces, where
each action is a composition of sub-actions. A standard RL approach ignores this inherent …

Guardar Citar Citado por 40 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] mdpi.com

Event-centric temporal knowledge graph construction: A survey

T Knez, S Žitnik - Mathematics, 2023 - mdpi.com

Textual documents serve as representations of discussions on a variety of subjects. These
discussions can vary in length and may encompass a range of events or factual information …

Guardar Citar Citado por 7 Artículos relacionados Las 5 versiones En caché

[Free GPT-4]

[PDF] mlr.press

An effective negotiating agent framework based on deep offline reinforcement learning

S Chen, J Zhao, G Weiss, R Su… - Uncertainty in Artificial …, 2023 - proceedings.mlr.press

Learning is crucial for automated negotiation, and recent years have witnessed a
remarkable achievement in application of reinforcement learning (RL) for various …

Guardar Citar Citado por 5 Artículos relacionados Las 6 versiones Búsqueda de bibliotecas Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Provably efficient risk-sensitive reinforcement learning: Iterated cvar and worst path

Y Du, S Wang, L Huang - arxiv preprint arxiv:2206.02678, 2022 - arxiv.org

In this paper, we study a novel episodic risk-sensitive Reinforcement Learning (RL) problem,
named Iterated CVaR RL, which aims to maximize the tail of the reward-to-go at each step …

Guardar Citar Citado por 27 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Continuous-Time decision transformer for healthcare applications

Z Zhang, H Mei, Y Xu - International Conference on Artificial …, 2023 - proceedings.mlr.press

Offline reinforcement learning (RL) is a promising approach for training intelligent medical
agents to learn treatment policies and assist decision making in many healthcare …

Guardar Citar Citado por 14 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] neurips.cc

Learning general world models in a handful of reward-free deployments

Y Xu, J Parker-Holder, A Pacchiano… - Advances in …, 2022 - proceedings.neurips.cc

Building generally capable agents is a grand challenge for deep reinforcement learning
(RL). To approach this challenge practically, we outline two key desiderata: 1) to facilitate …

Guardar Citar Citado por 10 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Connected and automated vehicles in mixed-traffic: Learning human driver behavior for effective on-ramp merging

N Venkatesh, VA Le, A Dave… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org

Highway merging scenarios featuring mixed traffic conditions pose significant modeling and
control challenges for connected and automated vehicles (CAVs) interacting with incoming …

Guardar Citar Citado por 16 Artículos relacionados Las 4 versiones

[Free GPT-4]

[PDF] acm.org

Deep offline reinforcement learning for real-world treatment optimization applications

M Nambiar, S Ghosh, P Ong, YE Chan… - Proceedings of the 29th …, 2023 - dl.acm.org

There is increasing interest in data-driven approaches for recommending optimal treatment
strategies in many chronic disease management and critical care applications …

Guardar Citar Citado por 15 Artículos relacionados Las 4 versiones

[Free GPT-4]

[PDF] google.com

[PDF][PDF] Risk-aware reinforcement learning with coherent risk measures and non-linear function approximation

T Lam, A Verma, BKH Low, P Jaillet - The Eleventh International …, 2022 - drive.google.com

We study the risk-aware reinforcement learning (RL) problem in the episodic finite-horizon
Markov decision process with unknown transition and reward functions. In contrast to the risk …

Guardar Citar Citado por 13 Artículos relacionados Las 4 versiones

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Medical dead-ends and learning to identify high-risk states and treatments

The curious price of distributional robustness in reinforcement learning with a generative model

Leveraging factored action spaces for efficient offline reinforcement learning in healthcare

Event-centric temporal knowledge graph construction: A survey

An effective negotiating agent framework based on deep offline reinforcement learning

Provably efficient risk-sensitive reinforcement learning: Iterated cvar and worst path

Continuous-Time decision transformer for healthcare applications

Learning general world models in a handful of reward-free deployments

Connected and automated vehicles in mixed-traffic: Learning human driver behavior for effective on-ramp merging

Deep offline reinforcement learning for real-world treatment optimization applications

[PDF][PDF] Risk-aware reinforcement learning with coherent risk measures and non-linear function approximation