Real-world robot applications of foundation models: A review
Recent developments in foundation models, like Large Language Models (LLMs) and Vision-
Language Models (VLMs), trained on extensive data, facilitate flexible application across …
Language Models (VLMs), trained on extensive data, facilitate flexible application across …
Human motion generation: A survey
Human motion generation aims to generate natural human pose sequences and shows
immense potential for real-world applications. Substantial progress has been made recently …
immense potential for real-world applications. Substantial progress has been made recently …
Motiongpt: Human motion as a foreign language
Though the advancement of pre-trained large language models unfolds, the exploration of
building a unified model for language and other multimodal data, such as motion, remains …
building a unified model for language and other multimodal data, such as motion, remains …
Motion-x: A large-scale 3d expressive whole-body human motion dataset
In this paper, we present Motion-X, a large-scale 3D expressive whole-body motion dataset.
Existing motion datasets predominantly contain body-only poses, lacking facial expressions …
Existing motion datasets predominantly contain body-only poses, lacking facial expressions …
Interdiff: Generating 3d human-object interactions with physics-informed diffusion
This paper addresses a novel task of anticipating 3D human-object interactions (HOIs). Most
existing research on HOI synthesis lacks comprehensive whole-body interactions with …
existing research on HOI synthesis lacks comprehensive whole-body interactions with …
Remodiffuse: Retrieval-augmented motion diffusion model
Abstract 3D human motion generation is crucial for creative industry. Recent advances rely
on generative models with domain knowledge for text-driven motion generation, leading to …
on generative models with domain knowledge for text-driven motion generation, leading to …
Michelangelo: Conditional 3d shape generation based on shape-image-text aligned latent representation
We present a novel alignment-before-generation approach to tackle the challenging task of
generating general 3D shapes based on 2D images or texts. Directly learning a conditional …
generating general 3D shapes based on 2D images or texts. Directly learning a conditional …
Motionlcm: Real-time controllable motion generation via latent consistency model
This work introduces MotionLCM, extending controllable motion generation to a real-time
level. Existing methods for spatial-temporal control in text-conditioned motion generation …
level. Existing methods for spatial-temporal control in text-conditioned motion generation …
Intergen: Diffusion-based multi-human motion generation under complex interactions
We have recently seen tremendous progress in diffusion advances for generating realistic
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …
Emdm: Efficient motion diffusion model for fast and high-quality motion generation
Abstract We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality
human motion generation. Current state-of-the-art generative diffusion models have …
human motion generation. Current state-of-the-art generative diffusion models have …