Follow-your-emoji: Fine-controllable and expressive freestyle portrait animation

Y Ma, H Liu, H Wang, H Pan, Y He, J Yuan… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We present Follow-Your-Emoji, a diffusion-based framework for portrait animation, which
animates a reference portrait with target landmark sequences. The main challenge of portrait …

Efficient diffusion models: A comprehensive survey from principles to practices

Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma… - arxiv preprint arxiv …, 2024 - arxiv.org
As one of the most popular and sought-after generative models in the recent years, diffusion
models have sparked the interests of many researchers and steadily shown excellent …

MegActor-: Unlocking Flexible Mixed-Modal Control in Portrait Animation with Diffusion Transformer

S Yang, H Li, J Wu, M **g, L Li, R Ji, J Liang… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models have demonstrated superior performance in the field of portrait animation.
However, current approaches relied on either visual or audio modality to control character …

Videomaker: Zero-shot customized video generation with the inherent force of video diffusion models

T Wu, Y Zhang, X Cun, Z Qi, J Pu, H Dou… - arxiv preprint arxiv …, 2024 - arxiv.org
Zero-shot customized video generation has gained significant attention due to its substantial
application potential. Existing methods rely on additional models to extract and inject …

Diffsim: Taming diffusion models for evaluating visual similarity

Y Song, X Liu, MZ Shou - arxiv preprint arxiv:2412.14580, 2024 - arxiv.org
Diffusion models have fundamentally transformed the field of generative models, making the
assessment of similarity between customized model outputs and reference inputs critically …

Grid: Visual layout generation

C Wan, X Luo, Z Cai, Y Song, Y Zhao, Y Bai… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we introduce GRID, a novel paradigm that reframes a broad range of visual
generation tasks as the problem of arranging grids, akin to film strips. At its core, GRID …

X-Dyna: Expressive Dynamic Human Image Animation

D Chang, H Xu, Y **e, Y Gao, Z Kuang, S Cai… - arxiv preprint arxiv …, 2025 - arxiv.org
We introduce X-Dyna, a novel zero-shot, diffusion-based pipeline for animating a single
human image using facial expressions and body movements derived from a driving video …

Anti-Reference: Universal and Immediate Defense Against Reference-Based Generation

Y Song, S Lou, X Liu, H Ci, P Yang, J Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models have revolutionized generative modeling with their exceptional ability to
produce high-fidelity images. However, misuse of such potent tools can lead to the creation …

Human motion video generation: A survey

H Xue, X Luo, Z Hu, X Zhang, X **ang, Y Dai, J Liu… - Authorea …, 2024 - techrxiv.org
Human motion video generation has garnered significant research interest due to its broad
applications, enabling innovations such as photorealistic singing heads or dynamic avatars …

Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion

H Wei, W Han, X Dong, J Shen - arxiv preprint arxiv:2411.10369, 2024 - arxiv.org
Recent diffusion-based Single-image 3D portrait generation methods typically employ 2D
diffusion models to provide multi-view knowledge, which is then distilled into 3D …