Emdm: Efficient motion diffusion model for fast and high-quality motion generation

W Zhou, Z Dou, Z Cao, Z Liao, J Wang, W Wang… - … on Computer Vision, 2024 - Springer
Abstract We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality
human motion generation. Current state-of-the-art generative diffusion models have …

Beyond talking–generating holistic 3d human dyadic motion for communication

M Sun, C Xu, X Jiang, Y Liu, B Sun… - International Journal of …, 2024 - Springer
In this paper, we introduce an innovative task focused on human communication, aiming to
generate 3D holistic human motions for both speakers and listeners. Central to our …

UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model

X Fan, J Li, Z Lin, W **ao, L Yang - European Conference on Computer …, 2024 - Springer
Audio-driven 3D facial animation aims to map input audio to realistic facial motion. Despite
significant progress, limitations arise from inconsistent 3D annotations, restricting previous …

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Y Li, R Hou, H Chang, S Shan, X Chen - arxiv preprint arxiv:2411.16781, 2024 - arxiv.org
Human pose plays a crucial role in the digital age. While recent works have achieved
impressive progress in understanding and generating human poses, they often support only …

Leveraging active perception for real-time high-resolution pose estimation

T Manousis, E Eleftheriadis, N Passalis… - Expert Systems with …, 2025 - Elsevier
The evolution of computational intelligence, especially deep learning, has revolutionized
problem-solving approaches, with human pose estimation emerging as a popular challenge …

SATPose: Improving monocular 3D pose estimation with spatial-aware ground tactility

L Zhan, E Ying, J Gan, S Guo, BY Gao… - Proceedings of the 32nd …, 2024 - dl.acm.org
Estimating 3D human poses from monocular images is an important research area with
many practical applications. However, the depth ambiguity of 2D solutions limits their …

SAT-HMR: Real-Time Multi-Person 3D Mesh Estimation via Scale-Adaptive Tokens

C Su, X Ma, J Su, Y Wang - arxiv preprint arxiv:2411.19824, 2024 - arxiv.org
We propose a one-stage framework for real-time multi-person 3D human mesh estimation
from a single RGB image. While current one-stage methods, which follow a DETR-style …

BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation

S Wang, J Li, T Li, Y Yuan, H Fuchs, K Nagano… - arxiv preprint arxiv …, 2024 - arxiv.org
Single-image human mesh recovery is a challenging task due to the ill-posed nature of
simultaneous body shape, pose, and camera estimation. Existing estimators work well on …

SIMS: Simulating Human-Scene Interactions with Real World Script Planning

W Wang, L Pan, Z Dou, Z Liao, Y Lou, L Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Simulating long-term human-scene interaction is a challenging yet fascinating task. Previous
works have not effectively addressed the generation of long-term human scene interactions …

Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation

Z Gu, ZQ Zhao, H Ding, H Shen, Z Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
In practical applications of human pose estimation, low-resolution inputs frequently occur,
and existing state-of-the-art models perform poorly with low-resolution images. This work …