Graph convolutional networks: a comprehensive review
Graphs naturally appear in numerous application domains, ranging from social analysis,
bioinformatics to computer vision. The unique capability of graphs enables capturing the …
bioinformatics to computer vision. The unique capability of graphs enables capturing the …
Generative adversarial networks for face generation: A survey
Recently, generative adversarial networks (GANs) have progressed enormously, which
makes them able to learn complex data distributions in particular faces. More and more …
makes them able to learn complex data distributions in particular faces. More and more …
Towards universal fake image detectors that generalize across generative models
With generative models proliferating at a rapid rate, there is a growing need for general
purpose fake image detectors. In this work, we first show that the existing paradigm, which …
purpose fake image detectors. In this work, we first show that the existing paradigm, which …
Dense text-to-image generation with attention modulation
Existing text-to-image diffusion models struggle to synthesize realistic images given dense
captions, where each text prompt provides a detailed description for a specific image region …
captions, where each text prompt provides a detailed description for a specific image region …
Diffusionclip: Text-guided diffusion models for robust image manipulation
Recently, GAN inversion methods combined with Contrastive Language-Image Pretraining
(CLIP) enables zero-shot image manipulation guided by text prompts. However, their …
(CLIP) enables zero-shot image manipulation guided by text prompts. However, their …
Sdedit: Guided image synthesis and editing with stochastic differential equations
Guided image synthesis enables everyday users to create and edit photo-realistic images
with minimum effort. The key challenge is balancing faithfulness to the user input (eg, hand …
with minimum effort. The key challenge is balancing faithfulness to the user input (eg, hand …
Taming transformers for high-resolution image synthesis
Designed to learn long-range interactions on sequential data, transformers continue to show
state-of-the-art results on a wide variety of tasks. In contrast to CNNs, they contain no …
state-of-the-art results on a wide variety of tasks. In contrast to CNNs, they contain no …
Peco: Perceptual codebook for bert pre-training of vision transformers
This paper explores a better prediction target for BERT pre-training of vision transformers.
We observe that current prediction targets disagree with human perception judgment. This …
We observe that current prediction targets disagree with human perception judgment. This …
De-fake: Detection and attribution of fake images generated by text-to-image generation models
Text-to-image generation models that generate images based on prompt descriptions have
attracted an increasing amount of attention during the past few months. Despite their …
attracted an increasing amount of attention during the past few months. Despite their …
Contrastive learning for unpaired image-to-image translation
In image-to-image translation, each patch in the output should reflect the content of the
corresponding patch in the input, independent of domain. We propose a straightforward …
corresponding patch in the input, independent of domain. We propose a straightforward …