Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
[HTML][HTML] Deep holography
G Situ - Light: Advanced Manufacturing, 2022 - light-am.com
With the explosive growth of mathematical optimization and computing hardware, deep
neural networks (DNN) have become tremendously powerful tools to solve many …
neural networks (DNN) have become tremendously powerful tools to solve many …
Generative diffusion prior for unified image restoration and enhancement
Existing image restoration methods mostly leverage the posterior distribution of natural
images. However, they often assume known degradation and also require supervised …
images. However, they often assume known degradation and also require supervised …
Collaborative diffusion for multi-modal face generation and editing
Diffusion models arise as a powerful generative tool recently. Despite the great progress,
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …
Hierarchical fine-grained image forgery detection and localization
Differences in forgery attributes of images generated in CNN-synthesized and image-editing
domains are large, and such differences make a unified image forgery detection and …
domains are large, and such differences make a unified image forgery detection and …
Headnerf: A real-time nerf-based parametric head model
In this paper, we propose HeadNeRF, a novel NeRF-based parametric head model that
integrates the neural radiance field to the parametric representation of the human head. It …
integrates the neural radiance field to the parametric representation of the human head. It …
Gan inversion: A survey
GAN inversion aims to invert a given image back into the latent space of a pretrained GAN
model so that the image can be faithfully reconstructed from the inverted code by the …
model so that the image can be faithfully reconstructed from the inverted code by the …
Diffusion models already have a semantic latent space
Diffusion models achieve outstanding generative performance in various domains. Despite
their great success, they lack semantic latent space which is essential for controlling the …
their great success, they lack semantic latent space which is essential for controlling the …
Stylespace analysis: Disentangled controls for stylegan image generation
We explore and analyze the latent style space of StyleGAN2, a state-of-the-art architecture
for image generation, using models pretrained on several different datasets. We first show …
for image generation, using models pretrained on several different datasets. We first show …
Ad-nerf: Audio driven neural radiance fields for talking head synthesis
Generating high-fidelity talking head video by fitting with the input audio sequence is a
challenging problem that receives considerable attentions recently. In this paper, we …
challenging problem that receives considerable attentions recently. In this paper, we …