Deep clustering: A comprehensive survey

Y Ren, J Pu, Z Yang, J Xu, G Li, X Pu… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Cluster analysis plays an indispensable role in machine learning and data mining. Learning
a good data representation is crucial for clustering algorithms. Recently, deep clustering …

Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

L Alzubaidi, J Bai, A Al-Sabaawi, J Santamaría… - Journal of Big Data, 2023 - Springer
Data scarcity is a major challenge when training deep learning (DL) models. DL demands a
large amount of data to achieve exceptional performance. Unfortunately, many applications …

Text-to-image diffusion models in generative ai: A survey

C Zhang, C Zhang, M Zhang, IS Kweon - arxiv preprint arxiv:2303.07909, 2023 - arxiv.org
This survey reviews text-to-image diffusion models in the context that diffusion models have
emerged to be popular for a wide range of generative tasks. As a self-contained work, this …

Protein design with guided discrete diffusion

N Gruver, S Stanton, N Frey… - Advances in neural …, 2023 - proceedings.neurips.cc
A popular approach to protein design is to combine a generative model with a discriminative
model for conditional sampling. The generative model samples plausible sequences while …

Vector quantized diffusion model for text-to-image synthesis

S Gu, D Chen, J Bao, F Wen, B Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present the vector quantized diffusion (VQ-Diffusion) model for text-to-image generation.
This method is based on a vector quantized variational autoencoder (VQ-VAE) whose latent …

Zero-shot text-guided object generation with dream fields

A Jain, B Mildenhall, JT Barron… - Proceedings of the …, 2022 - openaccess.thecvf.com
We combine neural rendering with multi-modal image and text representations to synthesize
diverse 3D objects solely from natural language descriptions. Our method, Dream Fields …

Diffusion models as plug-and-play priors

A Graikos, N Malkin, N Jojic… - Advances in Neural …, 2022 - proceedings.neurips.cc
We consider the problem of inferring high-dimensional data $ x $ in a model that consists of
a prior $ p (x) $ and an auxiliary differentiable constraint $ c (x, y) $ on $ x $ given some …

Ilvr: Conditioning method for denoising diffusion probabilistic models

J Choi, S Kim, Y Jeong, Y Gwon, S Yoon - arxiv preprint arxiv:2108.02938, 2021 - arxiv.org
Denoising diffusion probabilistic models (DDPM) have shown remarkable performance in
unconditional image generation. However, due to the stochasticity of the generative process …

A latent space of stochastic diffusion models for zero-shot image editing and guidance

CH Wu, F De la Torre - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Diffusion models generate images by iterative denoising. Recent work has shown that by
making the denoising process deterministic, one can encode real images into latent codes …