Diffusion models: A comprehensive survey of methods and applications

L Yang, Z Zhang, Y Song, S Hong, R Xu, Y Zhao… - ACM Computing …, 2023 - dl.acm.org
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …

Face image quality assessment: A literature survey

T Schlett, C Rathgeb, O Henniger, J Galbally… - ACM Computing …, 2022 - dl.acm.org
The performance of face analysis and recognition systems depends on the quality of the
acquired face data, which is influenced by numerous factors. Automatically assessing the …

An image is worth one word: Personalizing text-to-image generation using textual inversion

R Gal, Y Alaluf, Y Atzmon, O Patashnik… - arxiv preprint arxiv …, 2022 - arxiv.org
Text-to-image models offer unprecedented freedom to guide creation through natural
language. Yet, it is unclear how such freedom can be exercised to generate images of …

Encoder-based domain tuning for fast personalization of text-to-image models

R Gal, M Arar, Y Atzmon, AH Bermano… - ACM Transactions on …, 2023 - dl.acm.org
Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …

Score-based generative modeling in latent space

A Vahdat, K Kreis, J Kautz - Advances in neural information …, 2021 - proceedings.neurips.cc
Score-based generative models (SGMs) have recently demonstrated impressive results in
terms of both sample quality and distribution coverage. However, they are usually applied …

Hyperstyle: Stylegan inversion with hypernetworks for real image editing

Y Alaluf, O Tov, R Mokady, R Gal… - Proceedings of the …, 2022 - openaccess.thecvf.com
The inversion of real images into StyleGAN's latent space is a well-studied problem.
Nevertheless, applying existing approaches to real-world scenarios remains an open …

Pivotal tuning for latent-based editing of real images

D Roich, R Mokady, AH Bermano… - ACM Transactions on …, 2022 - dl.acm.org
Recently, numerous facial editing techniques have been proposed that leverage the
generative power of a pretrained StyleGAN. To successfully edit an image this way, one …

Perception prioritized training of diffusion models

J Choi, J Lee, C Shin, S Kim, H Kim… - Proceedings of the …, 2022 - openaccess.thecvf.com
Diffusion models learn to restore noisy data, which is corrupted with different levels of noise,
by optimizing the weighted sum of the corresponding loss terms, ie, denoising score …

Break-a-scene: Extracting multiple concepts from a single image

O Avrahami, K Aberman, O Fried, D Cohen-Or… - SIGGRAPH Asia 2023 …, 2023 - dl.acm.org
Text-to-image model personalization aims to introduce a user-provided concept to the
model, allowing its synthesis in diverse contexts. However, current methods primarily focus …

Designing an encoder for stylegan image manipulation

O Tov, Y Alaluf, Y Nitzan, O Patashnik… - ACM Transactions on …, 2021 - dl.acm.org
Recently, there has been a surge of diverse methods for performing image editing by
employing pre-trained unconditional generators. Applying these methods on real images …