Diffusion models, image super-resolution, and everything: A survey

BB Moser, AS Shanbhag, F Raue… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Diffusion models (DMs) have disrupted the image super-resolution (SR) field and further
closed the gap between image quality and human perceptual preferences. They are easy to …

Explainable generative ai (genxai): A survey, conceptualization, and research agenda

J Schneider - Artificial Intelligence Review, 2024 - Springer
Generative AI (GenAI) represents a shift from AI's ability to “recognize” to its ability to
“generate” solutions for a wide range of tasks. As generated solutions and applications grow …

Pix2video: Video editing using image diffusion

D Ceylan, CHP Huang, NJ Mitra - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Image diffusion models, trained on massive image collections, have emerged as the most
versatile image generator model in terms of quality and diversity. They support inverting real …

Diffusion self-guidance for controllable image generation

D Epstein, A Jabri, B Poole, A Efros… - Advances in Neural …, 2023 - proceedings.neurips.cc
Large-scale generative models are capable of producing high-quality images from detailed
prompts. However, many aspects of an image are difficult or impossible to convey through …

Text-to-image diffusion models in generative ai: A survey

C Zhang, C Zhang, M Zhang, IS Kweon - arxiv preprint arxiv:2303.07909, 2023 - arxiv.org
This survey reviews text-to-image diffusion models in the context that diffusion models have
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …

Spatext: Spatio-textual representation for controllable image generation

O Avrahami, T Hayes, O Gafni… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …

Sine: Single image editing with text-to-image diffusion models

Z Zhang, L Han, A Ghosh… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent works on diffusion models have demonstrated a strong capability for conditioning
image generation, eg, text-guided image synthesis. Such success inspires many efforts …

Multidiffusion: Fusing diffusion paths for controlled image generation

O Bar-Tal, L Yariv, Y Lipman, T Dekel - 2023 - openreview.net
Recent advances in text-to-image generation with diffusion models present transformative
capabilities in image quality. However, user controllability of the generated image, and fast …

Disentangled representation learning

X Wang, H Chen, Z Wu, W Zhu - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Disentangled Representation Learning (DRL) aims to learn a model capable of identifying
and disentangling the underlying factors hidden in the observable data in representation …

Matting anything

J Li, J Jain, H Shi - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
In this paper we propose the Matting Anything Model (MAM) an efficient and versatile
framework for estimating the alpha matte of any instance in an image with flexible and …