Controllable human-object interaction synthesis

J Li, A Clegg, R Mottaghi, J Wu, X Puig… - European Conference on …, 2024 - Springer
Synthesizing semantic-aware, long-horizon, human-object interaction is critical to simulate
realistic human behaviors. In this work, we address the challenging problem of generating …

Large motion model for unified multi-modal motion generation

M Zhang, D **, C Gu, F Hong, Z Cai, J Huang… - … on Computer Vision, 2024 - Springer
Human motion generation, a cornerstone technique in animation and video production, has
widespread applications in various tasks like text-to-motion and music-to-dance. Previous …

Diffh2o: Diffusion-based synthesis of hand-object interactions from textual descriptions

S Christen, S Hampali, F Sener, E Remelli… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We introduce DiffH2O, a new diffusion-based framework for synthesizing realistic, dexterous
hand-object interactions from natural language. Our model employs a temporal two-stage …

Interfusion: Text-driven generation of 3d human-object interaction

S Dai, W Li, H Sun, H Huang, C Ma, H Huang… - … on Computer Vision, 2024 - Springer
In this study, we tackle the complex task of generating 3D human-object interactions (HOI)
from textual descriptions in a zero-shot text-to-3D manner. We identify and address two key …

F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

J Yang, X Niu, N Jiang, R Zhang, S Huang - European Conference on …, 2024 - Springer
Existing 3D human object interaction (HOI) datasets and models simply align global
descriptions with the long HOI sequence, while lacking a detailed understanding of …

Diffcad: Weakly-supervised probabilistic cad model retrieval and alignment from an rgb image

D Gao, D Rozenberszki, S Leutenegger… - ACM Transactions on …, 2024 - dl.acm.org
Perceiving 3D structures from RGB images based on CAD model primitives can enable an
effective, efficient 3D object-based representation of scenes. However, current approaches …

Core4d: A 4d human-object-human interaction dataset for collaborative object rearrangement

C Zhang, Y Liu, R **ng, B Tang, L Yi - arxiv preprint arxiv:2406.19353, 2024 - arxiv.org
Understanding how humans cooperatively rearrange household objects is critical for VR/AR
and human-robot interaction. However, in-depth studies on modeling these behaviors are …

Crowdmogen: Zero-shot text-driven collective motion generation

X Guo, M Zhang, H **e, C Gu, Z Liu - arxiv preprint arxiv:2407.06188, 2024 - arxiv.org
Crowd Motion Generation is essential in entertainment industries such as animation and
games as well as in strategic fields like urban simulation and planning. This new task …

Thor: Text to human-object interaction diffusion via relation intervention

Q Wu, Y Shi, X Huang, J Yu, L Xu, J Wang - arxiv preprint arxiv …, 2024 - arxiv.org
This paper addresses new methodologies to deal with the challenging task of generating
dynamic Human-Object Interactions from textual descriptions (Text2HOI). While most …

Mimicking-bench: A benchmark for generalizable humanoid-scene interaction learning via human mimicking

Y Liu, B Yang, L Zhong, H Wang, L Yi - arxiv preprint arxiv:2412.17730, 2024 - arxiv.org
Learning generic skills for humanoid robots interacting with 3D scenes by mimicking human
data is a key research challenge with significant implications for robotics and real-world …