Smoodi: Stylized motion diffusion model
We introduce a novel Stylized Motion Diffusion model, dubbed SMooDi, to generate stylized
motion driven by content texts and style motion sequences. Unlike existing methods that …
motion driven by content texts and style motion sequences. Unlike existing methods that …
Ditto: Diffusion inference-time t-optimization for music generation
We propose Diffusion Inference-Time T-Optimization (DITTO), a general-purpose frame-
work for controlling pre-trained text-to-music diffusion models at inference-time via …
work for controlling pre-trained text-to-music diffusion models at inference-time via …
Iterative motion editing with natural language
Text-to-motion diffusion models can generate realistic animations from text prompts, but do
not support fine-grained motion editing controls. In this paper, we present a method for using …
not support fine-grained motion editing controls. In this paper, we present a method for using …
Inference-time scaling for diffusion models beyond scaling denoising steps
Generative models have made significant impacts across various domains, largely due to
their ability to scale during training by increasing data, computational resources, and model …
their ability to scale during training by increasing data, computational resources, and model …
RoHM: Robust Human Motion Reconstruction via Diffusion
We propose RoHM an approach for robust 3D human motion reconstruction from monocular
RGB (-D) videos in the presence of noise and occlusions. Most previous approaches either …
RGB (-D) videos in the presence of noise and occlusions. Most previous approaches either …
COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation
Estimating global human motion from moving cameras is challenging due to the
entanglement of human and camera motions. To mitigate the ambiguity, existing methods …
entanglement of human and camera motions. To mitigate the ambiguity, existing methods …
CLoSD: Closing the Loop between Simulation and Diffusion for multi-task character control
Motion diffusion models and Reinforcement Learning (RL) based control for physics-based
simulations have complementary strengths for human motion generation. The former is …
simulations have complementary strengths for human motion generation. The former is …
Dart: A diffusion-based autoregressive motion model for real-time text-driven motion control
Text-conditioned human motion generation, which allows for user interaction through natural
language, has become increasingly popular. Existing methods typically generate short …
language, has become increasingly popular. Existing methods typically generate short …
CPoser: An Optimization-after-Parsing Approach for Text-to-Pose Generation Using Large Language Models
Text-to-pose generation is challenging due to the complexity of natural language and
human posture semantics. Utilizing large language models (LLMs) for text-to-pose …
human posture semantics. Utilizing large language models (LLMs) for text-to-pose …
SKEL-Betweener: a Neural Motion Rig for Interactive Motion Authoring
Authoring 3D motions is a laborious process that requires manipulating and coordinating
many control handles over time. Neural motion representations learned from large motion …
many control handles over time. Neural motion representations learned from large motion …