Human motion generation: A survey
Human motion generation aims to generate natural human pose sequences and shows
immense potential for real-world applications. Substantial progress has been made recently …
immense potential for real-world applications. Substantial progress has been made recently …
Humangaussian: Text-driven 3d human generation with gaussian splatting
Realistic 3D human generation from text prompts is a desirable yet challenging task.
Existing methods optimize 3D representations like mesh or neural fields via score distillation …
Existing methods optimize 3D representations like mesh or neural fields via score distillation …
Taming diffusion models for audio-driven co-speech gesture generation
Animating virtual avatars to make co-speech gestures facilitates various applications in
human-machine interaction. The existing methods mainly rely on generative adversarial …
human-machine interaction. The existing methods mainly rely on generative adversarial …
Gesturediffuclip: Gesture diffusion model with clip latents
T Ao, Z Zhang, L Liu - ACM Transactions on Graphics (TOG), 2023 - dl.acm.org
The automatic generation of stylized co-speech gestures has recently received increasing
attention. Previous systems typically allow style control via predefined text labels or example …
attention. Previous systems typically allow style control via predefined text labels or example …
Generating holistic 3d human motion from speech
This work addresses the problem of generating 3D holistic body motions from human
speech. Given a speech recording, we synthesize sequences of 3D body poses, hand …
speech. Given a speech recording, we synthesize sequences of 3D body poses, hand …
Expressive talking head generation with granular audio-visual control
Generating expressive talking heads is essential for creating virtual humans. However,
existing one-or few-shot methods focus on lip-sync and head motion, ignoring the emotional …
existing one-or few-shot methods focus on lip-sync and head motion, ignoring the emotional …
Rhythmic gesticulator: Rhythm-aware co-speech gesture synthesis with hierarchical neural embeddings
Automatic synthesis of realistic co-speech gestures is an increasingly important yet
challenging task in artificial embodied agent creation. Previous systems mainly focus on …
challenging task in artificial embodied agent creation. Previous systems mainly focus on …
Semantic-aware implicit neural audio-driven video portrait generation
Animating high-fidelity video portrait with speech audio is crucial for virtual reality and digital
entertainment. While most previous studies rely on accurate explicit structural information …
entertainment. While most previous studies rely on accurate explicit structural information …
Livelyspeaker: Towards semantic-aware co-speech gesture generation
Gestures are non-verbal but important behaviors accompanying people's speech. While
previous methods are able to generate speech rhythm-synchronized gestures, the semantic …
previous methods are able to generate speech rhythm-synchronized gestures, the semantic …
Large motion model for unified multi-modal motion generation
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …