State of the art on diffusion models for visual computing
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
[HTML][HTML] Large language models for human-robot interaction: A review
The fusion of large language models and robotic systems has introduced a transformative
paradigm in human–robot interaction, offering unparalleled capabilities in natural language …
paradigm in human–robot interaction, offering unparalleled capabilities in natural language …
Pointllm: Empowering large language models to understand point clouds
The unprecedented advancements in Large Language Models (LLMs) have shown a
profound impact on natural language processing but are yet to fully embrace the realm of 3D …
profound impact on natural language processing but are yet to fully embrace the realm of 3D …
Motion mamba: Efficient and long sequence motion generation
Human motion generation stands as a significant pursuit in generative computer vision,
while achieving long-sequence and efficient motion generation remains challenging. Recent …
while achieving long-sequence and efficient motion generation remains challenging. Recent …
Motionlcm: Real-time controllable motion generation via latent consistency model
This work introduces MotionLCM, extending controllable motion generation to a real-time
level. Existing methods for spatial-temporal control in text-conditioned motion generation …
level. Existing methods for spatial-temporal control in text-conditioned motion generation …
Intergen: Diffusion-based multi-human motion generation under complex interactions
We have recently seen tremendous progress in diffusion advances for generating realistic
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …
human motions. Yet, they largely disregard the multi-human interactions. In this paper, we …
Emdm: Efficient motion diffusion model for fast and high-quality motion generation
Abstract We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality
human motion generation. Current state-of-the-art generative diffusion models have …
human motion generation. Current state-of-the-art generative diffusion models have …
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and Planning
Abstract Recent progress in Large Multimodal Models (LMM) has opened up great
possibilities for various applications in the field of human-machine interactions. However …
possibilities for various applications in the field of human-machine interactions. However …
[HTML][HTML] Multimodal large language models in health care: applications, challenges, and future outlook
In the complex and multidimensional field of medicine, multimodal data are prevalent and
crucial for informed clinical decisions. Multimodal data span a broad spectrum of data types …
crucial for informed clinical decisions. Multimodal data span a broad spectrum of data types …
Large motion model for unified multi-modal motion generation
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …