Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

[HTML][HTML] Deep holography

G Situ - Light: Advanced Manufacturing, 2022 - light-am.com
With the explosive growth of mathematical optimization and computing hardware, deep
neural networks (DNN) have become tremendously powerful tools to solve many …

Generative diffusion prior for unified image restoration and enhancement

B Fei, Z Lyu, L Pan, J Zhang, W Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing image restoration methods mostly leverage the posterior distribution of natural
images. However, they often assume known degradation and also require supervised …

Collaborative diffusion for multi-modal face generation and editing

Z Huang, KCK Chan, Y Jiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Diffusion models arise as a powerful generative tool recently. Despite the great progress,
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …

Hierarchical fine-grained image forgery detection and localization

X Guo, X Liu, Z Ren, S Grosz… - Proceedings of the …, 2023 - openaccess.thecvf.com
Differences in forgery attributes of images generated in CNN-synthesized and image-editing
domains are large, and such differences make a unified image forgery detection and …

Headnerf: A real-time nerf-based parametric head model

Y Hong, B Peng, H **ao, L Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we propose HeadNeRF, a novel NeRF-based parametric head model that
integrates the neural radiance field to the parametric representation of the human head. It …

Gan inversion: A survey

W **a, Y Zhang, Y Yang, JH Xue… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
GAN inversion aims to invert a given image back into the latent space of a pretrained GAN
model so that the image can be faithfully reconstructed from the inverted code by the …

Diffusion models already have a semantic latent space

M Kwon, J Jeong, Y Uh - arxiv preprint arxiv:2210.10960, 2022 - arxiv.org
Diffusion models achieve outstanding generative performance in various domains. Despite
their great success, they lack semantic latent space which is essential for controlling the …

Stylespace analysis: Disentangled controls for stylegan image generation

Z Wu, D Lischinski… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We explore and analyze the latent style space of StyleGAN2, a state-of-the-art architecture
for image generation, using models pretrained on several different datasets. We first show …

Ad-nerf: Audio driven neural radiance fields for talking head synthesis

Y Guo, K Chen, S Liang, YJ Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Generating high-fidelity talking head video by fitting with the input audio sequence is a
challenging problem that receives considerable attentions recently. In this paper, we …