Multimodal image synthesis and editing: A survey and taxonomy
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …
among multimodal information plays a key role for the creation and perception of multimodal …
Avatarclip: Zero-shot text-driven generation and animation of 3d avatars
3D avatar creation plays a crucial role in the digital age. However, the whole production
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
Collaborative diffusion for multi-modal face generation and editing
Diffusion models arise as a powerful generative tool recently. Despite the great progress,
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …
Vbench: Comprehensive benchmark suite for video generative models
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …
remains a challenge. A comprehensive evaluation benchmark for video generation is …
End-to-end reconstruction-classification learning for face forgery detection
Existing face forgery detectors mainly focus on specific forgery patterns like noise
characteristics, local textures, or frequency statistics for forgery detection. This causes …
characteristics, local textures, or frequency statistics for forgery detection. This causes …
Text2human: Text-driven controllable human image generation
Generating high-quality and diverse human images is an important yet challenging task in
vision and graphics. However, existing generative models often fall short under the high …
vision and graphics. However, existing generative models often fall short under the high …
Stylegan-human: A data-centric odyssey of human generation
Unconditional human image generation is an important task in vision and graphics, enabling
various applications in the creative industry. Existing studies in this field mainly focus on …
various applications in the creative industry. Existing studies in this field mainly focus on …
Generative recommendation: Towards next-generation recommender paradigm
Recommender systems typically retrieve items from an item corpus for personalized
recommendations. However, such a retrieval-based recommender paradigm faces two …
recommendations. However, such a retrieval-based recommender paradigm faces two …
CelebV-HQ: A large-scale video facial attributes dataset
Large-scale datasets have played indispensable roles in the recent success of face
generation/editing and significantly facilitated the advances of emerging research fields …
generation/editing and significantly facilitated the advances of emerging research fields …
Villandiffusion: A unified backdoor attack framework for diffusion models
Abstract Diffusion Models (DMs) are state-of-the-art generative models that learn a
reversible corruption process from iterative noise addition and denoising. They are the …
reversible corruption process from iterative noise addition and denoising. They are the …