Object-conditioned energy-based attention map alignment in text-to-image diffusion models
Text-to-image diffusion models have shown great success in generating high-quality text-
guided images. Yet, these models may still fail to semantically align generated images with …
guided images. Yet, these models may still fail to semantically align generated images with …
Flow Matching: Markov Kernels, Stochastic Processes and Transport Plans
Among generative neural models, flow matching techniques stand out for their simple
applicability and good scaling properties. Here, velocity fields of curves connecting a simple …
applicability and good scaling properties. Here, velocity fields of curves connecting a simple …
Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Text-to-image (T2I) diffusion models have revolutionized generative modeling by producing
high-fidelity, diverse, and visually realistic images from textual prompts. Despite these …
high-fidelity, diverse, and visually realistic images from textual prompts. Despite these …