Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Avatarclip: Zero-shot text-driven generation and animation of 3d avatars

F Hong, M Zhang, L Pan, Z Cai, L Yang… - arxiv preprint arxiv …, 2022 - arxiv.org
3D avatar creation plays a crucial role in the digital age. However, the whole production
process is prohibitively time-consuming and labor-intensive. To democratize this technology …

Collaborative diffusion for multi-modal face generation and editing

Z Huang, KCK Chan, Y Jiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Diffusion models arise as a powerful generative tool recently. Despite the great progress,
existing diffusion models mainly focus on uni-modal control, ie, the diffusion process is …

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

End-to-end reconstruction-classification learning for face forgery detection

J Cao, C Ma, T Yao, S Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Existing face forgery detectors mainly focus on specific forgery patterns like noise
characteristics, local textures, or frequency statistics for forgery detection. This causes …

Text2human: Text-driven controllable human image generation

Y Jiang, S Yang, H Qiu, W Wu, CC Loy… - ACM Transactions on …, 2022 - dl.acm.org
Generating high-quality and diverse human images is an important yet challenging task in
vision and graphics. However, existing generative models often fall short under the high …

Stylegan-human: A data-centric odyssey of human generation

J Fu, S Li, Y Jiang, KY Lin, C Qian, CC Loy… - … on Computer Vision, 2022 - Springer
Unconditional human image generation is an important task in vision and graphics, enabling
various applications in the creative industry. Existing studies in this field mainly focus on …

Generative recommendation: Towards next-generation recommender paradigm

W Wang, X Lin, F Feng, X He, TS Chua - arxiv preprint arxiv:2304.03516, 2023 - arxiv.org
Recommender systems typically retrieve items from an item corpus for personalized
recommendations. However, such a retrieval-based recommender paradigm faces two …

CelebV-HQ: A large-scale video facial attributes dataset

H Zhu, W Wu, W Zhu, L Jiang, S Tang, L Zhang… - European conference on …, 2022 - Springer
Large-scale datasets have played indispensable roles in the recent success of face
generation/editing and significantly facilitated the advances of emerging research fields …

Villandiffusion: A unified backdoor attack framework for diffusion models

SY Chou, PY Chen, TY Ho - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Diffusion Models (DMs) are state-of-the-art generative models that learn a
reversible corruption process from iterative noise addition and denoising. They are the …