Recovering 3d human mesh from monocular images: A survey
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
Motion-x: A large-scale 3d expressive whole-body human motion dataset
In this paper, we present Motion-X, a large-scale 3D expressive whole-body motion dataset.
Existing motion datasets predominantly contain body-only poses, lacking facial expressions …
Existing motion datasets predominantly contain body-only poses, lacking facial expressions …
Effective whole-body pose estimation with two-stages distillation
Whole-body pose estimation localizes the human body, hand, face, and foot keypoints in an
image. This task is challenging due to multi-scale body parts, fine-grained localization for …
image. This task is challenging due to multi-scale body parts, fine-grained localization for …
Smpler-x: Scaling up expressive human pose and shape estimation
Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion
capture with numerous applications. Despite encouraging progress, current state-of-the-art …
capture with numerous applications. Despite encouraging progress, current state-of-the-art …
Grounded sam: Assembling open-world models for diverse visual tasks
We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to
combine with the segment anything model (SAM). This integration enables the detection and …
combine with the segment anything model (SAM). This integration enables the detection and …
Chatpose: Chatting about 3d human pose
We introduce ChatPose a framework employing Large Language Models (LLMs) to
understand and reason about 3D human poses from images or textual descriptions. Our …
understand and reason about 3D human poses from images or textual descriptions. Our …
Digital life project: Autonomous 3d characters with social intelligence
In this work we present Digital Life Project a framework utilizing language as the universal
medium to build autonomous 3D characters who are capable of engaging in social …
medium to build autonomous 3D characters who are capable of engaging in social …
Wham: Reconstructing world-grounded humans with accurate 3d motion
The estimation of 3D human motion from video has progressed rapidly but current methods
still have several key limitations. First most methods estimate the human in camera …
still have several key limitations. First most methods estimate the human in camera …
Expressive whole-body 3D gaussian avatar
Facial expression and hand motions are necessary to express our emotions and interact
with the world. Nevertheless, most of the 3D human avatars modeled from a casually …
with the world. Nevertheless, most of the 3D human avatars modeled from a casually …
A single 2d pose with context is worth hundreds for 3d human pose estimation
The dominant paradigm in 3D human pose estimation that lifts a 2D pose sequence to 3D
heavily relies on long-term temporal clues (ie, using a daunting number of video frames) for …
heavily relies on long-term temporal clues (ie, using a daunting number of video frames) for …