[PDF][PDF] Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review

M Uehara, Y Zhao, C Wang, X Li, A Regev… - arxiv preprint arxiv …, 2025 - ai-plans.com
This tutorial provides an in-depth guide on inference-time guidance and alignment methods
for optimizing downstream reward functions in diffusion models. While diffusion models are …

Steering masked discrete diffusion models via discrete denoising posterior prediction

J Rector-Brooks, M Hasan, Z Peng, Z Quinn… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative modeling of discrete data underlies important applications spanning text-based
agents like ChatGPT to the design of the very building blocks of life in protein sequences …

Sequential controlled langevin diffusions

J Chen, L Richter, J Berner, D Blessing… - arxiv preprint arxiv …, 2024 - arxiv.org
An effective approach for sampling from unnormalized densities is based on the idea of
gradually transporting samples from an easy prior to the complicated target distribution. Two …

From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

J Berner, L Richter, M Sendera, J Rector-Brooks… - arxiv preprint arxiv …, 2025 - arxiv.org
We study the problem of training neural stochastic differential equations, or diffusion models,
to sample from a Boltzmann distribution without access to target samples. Existing methods …

Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review

M Uehara, Y Zhao, C Wang, X Li, A Regev… - arxiv preprint arxiv …, 2025 - arxiv.org
This tutorial provides an in-depth guide on inference-time guidance and alignment methods
for optimizing downstream reward functions in diffusion models. While diffusion models are …

Generative flows on synthetic pathway for drug design

S Seo, M Kim, T Shen, M Ester, J Park, S Ahn… - arxiv preprint arxiv …, 2024 - arxiv.org
Generative models in drug discovery have recently gained attention as efficient alternatives
to brute-force virtual screening. However, most existing models do not account for …

Can a Bayesian Oracle Prevent Harm from an Agent?

Y Bengio, MK Cohen, N Malkin, M MacDermott… - arxiv preprint arxiv …, 2024 - arxiv.org
Is there a way to design powerful AI systems based on machine learning methods that would
satisfy probabilistic safety guarantees? With the long-term goal of obtaining a probabilistic …

Gflownet pretraining with inexpensive rewards

M Pandey, G Subbaraj, E Bengio - arxiv preprint arxiv:2409.09702, 2024 - arxiv.org
Generative Flow Networks (GFlowNets), a class of generative models have recently
emerged as a suitable framework for generating diverse and high-quality molecular …

Adaptive teachers for amortized samplers

M Kim, S Choi, T Yun, E Bengio, L Feng… - arxiv preprint arxiv …, 2024 - arxiv.org
Amortized inference is the task of training a parametric model, such as a neural network, to
approximate a distribution with a given unnormalized density where exact sampling is …

Beyond squared error: Exploring loss design for enhanced training of generative flow networks

R Hu, Y Zhang, Z Li, L Huang - arxiv preprint arxiv:2410.02596, 2024 - arxiv.org
Generative Flow Networks (GFlowNets) are a novel class of generative models designed to
sample from unnormalized distributions and have found applications in various important …