Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Modulated contrast for versatile image synthesis

F Zhan, J Zhang, Y Yu, R Wu… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Perceiving the similarity between images has been a long-standing and fundamental
problem underlying various visual generation tasks. Predominant approaches measure the …

Prompt-free diffusion: Taking" text" out of text-to-image diffusion models

X Xu, J Guo, Z Wang, G Huang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Text-to-image (T2I) research has grown explosively in the past year owing to the
large-scale pre-trained diffusion models and many emerging personalization and editing …

Diverse image inpainting with bidirectional and autoregressive transformers

Y Yu, F Zhan, R Wu, J Pan, K Cui, S Lu, F Ma… - Proceedings of the 29th …, 2021 - dl.acm.org
Image inpainting is an underdetermined inverse problem, which naturally allows diverse
contents to fill up the missing or corrupted regions realistically. Prevalent approaches using …

Marginal contrastive correspondence for guided image generation

F Zhan, Y Yu, R Wu, J Zhang, S Lu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Exemplar-based image translation establishes dense correspondences between a
conditional input and an exemplar (from two different domains) for leveraging detailed …

Auto-regressive image synthesis with integrated quantization

F Zhan, Y Yu, R Wu, J Zhang, K Cui, C Zhang… - European Conference on …, 2022 - Springer
Deep generative models have achieved conspicuous progress in realistic image synthesis
with multifarious conditional inputs, while generating diverse yet high-fidelity images …

Emlight: Lighting estimation via spherical distribution approximation

F Zhan, C Zhang, Y Yu, Y Chang, S Lu, F Ma… - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Illumination estimation from a single image is critical in 3D rendering and it has been
investigated extensively in the computer vision and computer graphic research community …

Wavefill: A wavelet-based generation network for image inpainting

Y Yu, F Zhan, S Lu, J Pan, F Ma… - Proceedings of the …, 2021 - openaccess.thecvf.com
Image inpainting aims to complete the missing or corrupted regions of images with realistic
contents. The prevalent approaches adopt a hybrid objective of reconstruction and …

Sparse needlets for lighting estimation with spherical transport loss

F Zhan, C Zhang, W Hu, S Lu, F Ma… - Proceedings of the …, 2021 - openaccess.thecvf.com
Accurate lighting estimation is challenging yet critical to many computer vision and computer
graphics tasks such as high-dynamic-range (HDR) relighting. Existing approaches model …

Dynast: Dynamic sparse transformer for exemplar-guided image generation

S Liu, J Ye, S Ren, X Wang - European Conference on Computer Vision, 2022 - Springer
One key challenge of exemplar-guided image generation lies in establishing fine-grained
correspondences between input and guided images. Prior approaches, despite the …