Deep generative models for offline policy learning: Tutorial, survey, and perspectives on future directions
Deep generative models (DGMs) have demonstrated great success across various domains,
particularly in generating texts, images, and videos using models trained from offline data …
particularly in generating texts, images, and videos using models trained from offline data …
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
While machine learning models are typically trained to solve prediction problems, we might
often want to use them for optimization problems. For example, given a dataset of proteins …
often want to use them for optimization problems. For example, given a dataset of proteins …
Latent energy-based odyssey: Black-box optimization via expanded exploration in the energy-based latent space
Offline Black-Box Optimization (BBO) aims at optimizing a black-box function using the
knowledge from a pre-collected offline dataset of function values and corresponding input …
knowledge from a pre-collected offline dataset of function values and corresponding input …
Latent Conservative Objective Models for Data-Driven Crystal Structure Prediction
In computational chemistry, crystal structure prediction (CSP) is an optimization problem that
involves discovering the lowest energy stable crystal structure for a given chemical formula …
involves discovering the lowest energy stable crystal structure for a given chemical formula …
Sharpness-Aware Black-Box Optimization
Black-box optimization algorithms have been widely used in various machine learning
problems, including reinforcement learning and prompt fine-tuning. However, directly …
problems, including reinforcement learning and prompt fine-tuning. However, directly …
Offline Model-Based Optimization by Learning to Rank
Offline model-based optimization (MBO) aims to identify a design that maximizes a black-
box function using only a fixed, pre-collected dataset of designs and their corresponding …
box function using only a fixed, pre-collected dataset of designs and their corresponding …
Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows
Despite notable successes of Reinforcement Learning (RL), the prevalent use of an online
learning paradigm prevents its widespread adoption, especially in hazardous or costly …
learning paradigm prevents its widespread adoption, especially in hazardous or costly …
Latent conservative objective models for offline data-driven crystal structure prediction
In computational chemistry, crystal structure prediction (CSP) is an optimization problem that
involves discovering the lowest energy stable crystal structure for a given chemical formula …
involves discovering the lowest energy stable crystal structure for a given chemical formula …
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Offline reinforcement learning algorithms often require careful hyperparameter tuning.
Consequently, before deployment, we need to select amongst a set of candidate policies. As …
Consequently, before deployment, we need to select amongst a set of candidate policies. As …
ROMO: Retrieval-enhanced Offline Model-based Optimization
M Chen, H Zhao, Y Zhao, H Fan, H Gao, Y Yu… - Proceedings of the Fifth …, 2023 - dl.acm.org
Data-driven black-box model-based optimization (MBO) problems arise in a great number of
practical application scenarios, where the goal is to find a design over the whole space …
practical application scenarios, where the goal is to find a design over the whole space …