Deep clustering: A comprehensive survey
Cluster analysis plays an indispensable role in machine learning and data mining. Learning
a good data representation is crucial for clustering algorithms. Recently, deep clustering …
a good data representation is crucial for clustering algorithms. Recently, deep clustering …
Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications
Data scarcity is a major challenge when training deep learning (DL) models. DL demands a
large amount of data to achieve exceptional performance. Unfortunately, many applications …
large amount of data to achieve exceptional performance. Unfortunately, many applications …
Text-to-image diffusion models in generative ai: A survey
This survey reviews text-to-image diffusion models in the context that diffusion models have
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …
Protein design with guided discrete diffusion
A popular approach to protein design is to combine a generative model with a discriminative
model for conditional sampling. The generative model samples plausible sequences while …
model for conditional sampling. The generative model samples plausible sequences while …
Vector quantized diffusion model for text-to-image synthesis
We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation.
This method is based on a vector quantized variational autoencoder (VQ-VAE) whose latent …
This method is based on a vector quantized variational autoencoder (VQ-VAE) whose latent …
Zero-shot text-guided object generation with dream fields
We combine neural rendering with multi-modal image and text representations to synthesize
diverse 3D objects solely from natural language descriptions. Our method, Dream Fields …
diverse 3D objects solely from natural language descriptions. Our method, Dream Fields …
Diffusion models as plug-and-play priors
We consider the problem of inferring high-dimensional data $ x $ in a model that consists of
a prior $ p (x) $ and an auxiliary differentiable constraint $ c (x, y) $ on $ x $ given some …
a prior $ p (x) $ and an auxiliary differentiable constraint $ c (x, y) $ on $ x $ given some …
Ilvr: Conditioning method for denoising diffusion probabilistic models
Denoising diffusion probabilistic models (DDPM) have shown remarkable performance in
unconditional image generation. However, due to the stochasticity of the generative process …
unconditional image generation. However, due to the stochasticity of the generative process …
A latent space of stochastic diffusion models for zero-shot image editing and guidance
Diffusion models generate images by iterative denoising. Recent work has shown that by
making the denoising process deterministic, one can encode real images into latent codes …
making the denoising process deterministic, one can encode real images into latent codes …