Recovering 3d human mesh from monocular images: A survey

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

Econ: Explicit clothed humans optimized via normal integration

Y **u, J Yang, X Cao, D Tzionas… - Proceedings of the …, 2023 - openaccess.thecvf.com
The combination of deep learning, artist-curated scans, and Implicit Functions (IF), is
enabling the creation of detailed, clothed, 3D humans from images. However, existing …

Effective whole-body pose estimation with two-stages distillation

Z Yang, A Zeng, C Yuan, Y Li - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Whole-body pose estimation localizes the human body, hand, face, and foot keypoints in an
image. This task is challenging due to multi-scale body parts, fine-grained localization for …

Smpler-x: Scaling up expressive human pose and shape estimation

Z Cai, W Yin, A Zeng, C Wei, Q Sun… - Advances in …, 2024 - proceedings.neurips.cc
Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion
capture with numerous applications. Despite encouraging progress, current state-of-the-art …

ARCTIC: A dataset for dexterous bimanual hand-object manipulation

Z Fan, O Taheri, D Tzionas… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans intuitively understand that inanimate objects do not move by themselves, but that
state changes are typically caused by human manipulation (eg, the opening of a book). This …

TRACE: 5D temporal regression of avatars with dynamic cameras in 3D environments

Y Sun, Q Bao, W Liu, T Mei… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Although the estimation of 3D human pose and shape (HPS) is rapidly progressing, current
methods still cannot reliably estimate moving humans in global coordinates, which is critical …

Depth pro: Sharp monocular metric depth in less than a second

A Bochkovskii, A Delaunoy, H Germain… - arxiv preprint arxiv …, 2024 - arxiv.org
We present a foundation model for zero-shot metric monocular depth estimation. Our model,
Depth Pro, synthesizes high-resolution depth maps with unparalleled sharpness and high …

Refit: Recurrent fitting network for 3d human recovery

Y Wang, K Daniilidis - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Abstract We present Recurrent Fitting (ReFit), a neural network architecture for single-image,
parametric 3D human reconstruction. ReFit learns a feedback-update loop that mirrors the …

Zolly: Zoom focal length correctly for perspective-distorted human mesh reconstruction

W Wang, Y Ge, H Mei, Z Cai, Q Sun… - Proceedings of the …, 2023 - openaccess.thecvf.com
As it is hard to calibrate single-view RGB images in the wild, existing 3D human mesh
reconstruction (3DHMR) methods either use a constant large focal length or estimate one …

360-degree Human Video Generation with 4D Diffusion Transformer

R Shao, Y Pang, Z Zheng, J Sun, Y Liu - ACM Transactions on Graphics …, 2024 - dl.acm.org
We present a novel approach for generating 360-degree high-quality, spatiotemporally
coherent human videos from a single image. Our framework combines the strengths of …