Transformers in medical imaging: A survey

F Shamshad, S Khan, SW Zamir, MH Khan… - Medical Image …, 2023 - Elsevier
Following unprecedented success on the natural language tasks, Transformers have been
successfully applied to several computer vision problems, achieving state-of-the-art results …

A review on generative adversarial networks for image generation

VLT De Souza, BAD Marques, HC Batagelo… - Computers & …, 2023 - Elsevier
Abstract Generative Adversarial Networks (GANs) are a type of deep learning architecture
that uses two networks namely a generator and a discriminator that, by competing against …

Plug-and-play diffusion features for text-driven image-to-image translation

N Tumanyan, M Geyer, S Bagon… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large-scale text-to-image generative models have been a revolutionary breakthrough in the
evolution of generative AI, synthesizing diverse images with highly complex visual concepts …

Imagic: Text-based real image editing with diffusion models

B Kawar, S Zada, O Lang, O Tov… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-conditioned image editing has recently attracted considerable interest. However, most
methods are currently limited to one of the following: specific editing types (eg, object …

Diffusion self-guidance for controllable image generation

D Epstein, A Jabri, B Poole, A Efros… - Advances in Neural …, 2023 - proceedings.neurips.cc
Large-scale generative models are capable of producing high-quality images from detailed
prompts. However, many aspects of an image are difficult or impossible to convey through …

Tokenflow: Consistent diffusion features for consistent video editing

M Geyer, O Bar-Tal, S Bagon, T Dekel - arxiv preprint arxiv:2307.10373, 2023 - arxiv.org
The generative AI revolution has recently expanded to videos. Nevertheless, current state-of-
the-art video models are still lagging behind image models in terms of visual quality and …

Towards universal fake image detectors that generalize across generative models

U Ojha, Y Li, YJ Lee - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
With generative models proliferating at a rapid rate, there is a growing need for general
purpose fake image detectors. In this work, we first show that the existing paradigm, which …

Text2live: Text-driven layered image and video editing

O Bar-Tal, D Ofri-Amar, R Fridman, Y Kasten… - European conference on …, 2022 - Springer
We present a method for zero-shot, text-driven editing of natural images and videos. Given
an image or a video and a text prompt, our goal is to edit the appearance of existing objects …

One-step diffusion with distribution matching distillation

T Yin, M Gharbi, R Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models generate high-quality images but require dozens of forward passes. We
introduce Distribution Matching Distillation (DMD) a procedure to transform a diffusion model …

Egsde: Unpaired image-to-image translation via energy-guided stochastic differential equations

M Zhao, F Bao, C Li, J Zhu - Advances in Neural …, 2022 - proceedings.neurips.cc
Score-based diffusion models (SBDMs) have achieved the SOTA FID results in unpaired
image-to-image translation (I2I). However, we notice that existing methods totally ignore the …