Large motion model for unified multi-modal motion generation
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis
In this work, we present Semantic Gesticulator, a novel framework designed to synthesize
realistic gestures accompanying speech with strong semantic correspondence. Semantically …
realistic gestures accompanying speech with strong semantic correspondence. Semantically …
Lodge: A coarse to fine diffusion network for long dance generation guided by the characteristic dance primitives
We propose Lodge a network capable of generating extremely long dance sequences
conditioned on given music. We design Lodge as a two-stage coarse to fine diffusion …
conditioned on given music. We design Lodge as a two-stage coarse to fine diffusion …
Mambatalk: Efficient holistic gesture synthesis with selective state space models
Gesture synthesis is a vital realm of human-computer interaction, with wide-ranging
applications across various fields like film, robotics, and virtual reality. Recent advancements …
applications across various fields like film, robotics, and virtual reality. Recent advancements …
Freetalker: Controllable speech and text-driven gesture generation based on diffusion models for enhanced speaker naturalness
Current talking avatars mostly generate co-speech gestures based on audio and text of the
utterance, without considering the non-speaking motion of the speaker. Furthermore …
utterance, without considering the non-speaking motion of the speaker. Furthermore …
Synergistic Attention-Guided Cascaded Graph Diffusion Model for Complementarity Determining Region Synthesis
R Zhang, Y Huang, Y Lou, W Ding… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Complementarity determining region (CDR) is a specific region in antibody molecules that
binds to antigens, where a small portion of residues undergoes particularly pronounced …
binds to antigens, where a small portion of residues undergoes particularly pronounced …
Learning Co-Speech Gesture Representations in Dialogue through Contrastive Learning: An Intrinsic Evaluation
In face-to-face dialogues, the form-meaning relationship of co-speech gestures varies
depending on contextual factors such as what the gestures refer to and the individual …
depending on contextual factors such as what the gestures refer to and the individual …
SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
A good co-speech motion generation cannot be achieved without a careful integration of
common rhythmic motion and rare yet essential semantic motion. In this work, we propose …
common rhythmic motion and rare yet essential semantic motion. In this work, we propose …
Alignment is All You Need: A Training-free Augmentation Strategy for Pose-guided Video Generation
X **, Z Xu, M Ou, W Yang - arxiv preprint arxiv:2408.16506, 2024 - arxiv.org
Character animation is a transformative field in computer graphics and vision, enabling
dynamic and realistic video animations from static images. Despite advancements …
dynamic and realistic video animations from static images. Despite advancements …