Optimal conservative offline rl with general function approximation via augmented lagrangian
Offline reinforcement learning (RL), which refers to decision-making from a previously-
collected dataset of interactions, has received significant attention over the past years. Much …
collected dataset of interactions, has received significant attention over the past years. Much …
An effective negotiating agent framework based on deep offline reinforcement learning
Learning is crucial for automated negotiation, and recent years have witnessed a
remarkable achievement in application of reinforcement learning (RL) for various …
remarkable achievement in application of reinforcement learning (RL) for various …
When demonstrations meet generative world models: A maximum likelihood framework for offline inverse reinforcement learning
Offline inverse reinforcement learning (Offline IRL) aims to recover the structure of rewards
and environment dynamics that underlie observed actions in a fixed, finite set of …
and environment dynamics that underlie observed actions in a fixed, finite set of …
Understanding expertise through demonstrations: A maximum likelihood framework for offline inverse reinforcement learning
Offline inverse reinforcement learning (Offline IRL) aims to recover the structure of rewards
and environment dynamics that underlie observed actions in a fixed, finite set of …
and environment dynamics that underlie observed actions in a fixed, finite set of …
SCORE: Simple Contrastive Representation and Reset-Ensemble for offline meta-reinforcement learning
H Yang, K Lin, T Yang, G Sun - Knowledge-Based Systems, 2025 - Elsevier
Offline meta-reinforcement learning (OMRL) aims to train agents to quickly adapt to new
tasks using only pre-collected data. However, existing OMRL methods often involve …
tasks using only pre-collected data. However, existing OMRL methods often involve …
A simple unified uncertainty-guided framework for offline-to-online reinforcement learning
Offline reinforcement learning (RL) provides a promising solution to learning an agent fully
relying on a data-driven paradigm. However, constrained by the limited quality of the offline …
relying on a data-driven paradigm. However, constrained by the limited quality of the offline …
Representation-driven reinforcement learning
We present a representation-driven framework for reinforcement learning. By representing
policies as estimates of their expected values, we leverage techniques from contextual …
policies as estimates of their expected values, we leverage techniques from contextual …
Delphic offline reinforcement learning under nonidentifiable hidden confounding
A prominent challenge of offline reinforcement learning (RL) is the issue of hidden
confounding: unobserved variables may influence both the actions taken by the agent and …
confounding: unobserved variables may influence both the actions taken by the agent and …
Sumo: Search-based uncertainty estimation for model-based offline reinforcement learning
The performance of offline reinforcement learning (RL) suffers from the limited size and
quality of static datasets. Model-based offline RL addresses this issue by generating …
quality of static datasets. Model-based offline RL addresses this issue by generating …
Embedding-Aligned Language Models
We propose a novel approach for training large language models (LLMs) to adhere to
objectives defined within a latent embedding space. Our method leverages reinforcement …
objectives defined within a latent embedding space. Our method leverages reinforcement …