Recovering 3d human mesh from monocular images: A survey

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

Motion-x: A large-scale 3d expressive whole-body human motion dataset

J Lin, A Zeng, S Lu, Y Cai, R Zhang… - Advances in Neural …, 2023 - proceedings.neurips.cc
In this paper, we present Motion-X, a large-scale 3D expressive whole-body motion dataset.
Existing motion datasets predominantly contain body-only poses, lacking facial expressions …

Effective whole-body pose estimation with two-stages distillation

Z Yang, A Zeng, C Yuan, Y Li - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Whole-body pose estimation localizes the human body, hand, face, and foot keypoints in an
image. This task is challenging due to multi-scale body parts, fine-grained localization for …

Smpler-x: Scaling up expressive human pose and shape estimation

Z Cai, W Yin, A Zeng, C Wei, Q Sun… - Advances in …, 2023 - proceedings.neurips.cc
Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion
capture with numerous applications. Despite encouraging progress, current state-of-the-art …

Grounded sam: Assembling open-world models for diverse visual tasks

T Ren, S Liu, A Zeng, J Lin, K Li, H Cao, J Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce Grounded SAM, which uses Grounding DINO as an open-set object detector to
combine with the segment anything model (SAM). This integration enables the detection and …

Chatpose: Chatting about 3d human pose

Y Feng, J Lin, SK Dwivedi, Y Sun… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce ChatPose a framework employing Large Language Models (LLMs) to
understand and reason about 3D human poses from images or textual descriptions. Our …

Digital life project: Autonomous 3d characters with social intelligence

Z Cai, J Jiang, Z Qing, X Guo… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this work we present Digital Life Project a framework utilizing language as the universal
medium to build autonomous 3D characters who are capable of engaging in social …

Wham: Reconstructing world-grounded humans with accurate 3d motion

S Shin, J Kim, E Halilaj… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
The estimation of 3D human motion from video has progressed rapidly but current methods
still have several key limitations. First most methods estimate the human in camera …

Expressive whole-body 3D gaussian avatar

G Moon, T Shiratori, S Saito - European Conference on Computer Vision, 2024 - Springer
Facial expression and hand motions are necessary to express our emotions and interact
with the world. Nevertheless, most of the 3D human avatars modeled from a casually …

A single 2d pose with context is worth hundreds for 3d human pose estimation

Q Zhao, C Zheng, M Liu… - Advances in Neural …, 2023 - proceedings.neurips.cc
The dominant paradigm in 3D human pose estimation that lifts a 2D pose sequence to 3D
heavily relies on long-term temporal clues (ie, using a daunting number of video frames) for …