Bayesian flow networks

A Graves, RK Srivastava, T Atkinson… - arxiv preprint arxiv …, 2023 - arxiv.org
This paper introduces Bayesian Flow Networks (BFNs), a new class of generative model in
which the parameters of a set of independent distributions are modified with Bayesian …

Simple Guidance Mechanisms for Discrete Diffusion Models

Y Schiff, SS Sahoo, H Phung, G Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models for continuous data gained widespread adoption owing to their high quality
generation and control mechanisms. However, controllable diffusion on discrete data faces …

Generative Marginalization Models

S Liu, PJ Ramadge, RP Adams - arxiv preprint arxiv:2310.12920, 2023 - arxiv.org
We introduce marginalization models (MaMs), a new family of generative models for high-
dimensional discrete data. They offer scalable and flexible generative modeling with …

Probabilistic Neural Circuits

PZ Dos Martires - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
Probabilistic circuits (PCs) have gained prominence in recent years as a versatile framework
for discussing probabilistic models that support tractable queries and are yet expressive …

Mitigating Embedding Collapse in Diffusion Models for Categorical Data

B Nguyen, Y Takida, N Murata, T Uesaka… - arxiv preprint arxiv …, 2024 - arxiv.org
Latent diffusion models have enabled continuous-state diffusion models to handle a variety
of datasets, including categorical data. However, most methods rely on fixed pretrained …

Parallel Sampling via Counting

N Anari, R Gao, A Rubinstein - Proceedings of the 56th Annual ACM …, 2024 - dl.acm.org
We show how to use parallelization to speed up sampling from an arbitrary distribution µ on
a product space [q] n, given oracle access to counting queries: ℙ X∼ µ [XS= σ S] for any …

Learning multi-modal generative models with permutation-invariant encoders and tighter variational bounds

M Hirt, D Campolo, V Leong, JP Ortega - arxiv preprint arxiv:2309.00380, 2023 - arxiv.org
Devising deep latent variable models for multi-modal data has been a long-standing theme
in machine learning research. Multi-modal Variational Autoencoders (VAEs) have been a …

Autoregressive Diffusion Models with non-Uniform Generation Order

FE Kelvinius, F Lindsten - ICML 2023 Workshop on Structured …, 2023 - openreview.net
Diffusion models for discrete data have gained increasing interest lately. Recent methods
use an autoregressive formulation, but where the generation order is random. In this work …

Simplified and Generalized Masked Diffusion for Discrete Data

J Shi, K Han, Z Wang, A Doucet, MK Titsias - arxiv preprint arxiv …, 2024 - arxiv.org
Masked (or absorbing) diffusion is actively explored as an alternative to autoregressive
models for generative modeling of discrete data. However, existing work in this area has …

Probabilistic neural circuits

PZD Martires - arxiv preprint arxiv:2403.06235, 2024 - arxiv.org
Probabilistic circuits (PCs) have gained prominence in recent years as a versatile framework
for discussing probabilistic models that support tractable queries and are yet expressive …