Recovering 3d human mesh from monocular images: A survey

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

Champ: Controllable and consistent human image animation with 3d parametric guidance

S Zhu, JL Chen, Z Dai, Z Dong, Y Xu, X Cao… - … on Computer Vision, 2024 - Springer
In this study, we introduce a methodology for human image animation by leveraging a 3D
human parametric model within a latent diffusion framework to enhance shape alignment …

Hugs: Human gaussian splats

M Kocabas, JHR Chang, J Gabriel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advances in neural rendering have improved both training and rendering times by
orders of magnitude. While these methods demonstrate state-of-the-art quality and speed …

Gart: Gaussian articulated template models

J Lei, Y Wang, G Pavlakos, L Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract We introduce Gaussian Articulated Template Model (GART) an explicit efficient and
expressive representation for non-rigid articulated subject capturing and rendering from …

Reconstructing hands in 3d with transformers

G Pavlakos, D Shan, I Radosavovic… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …

Chatpose: Chatting about 3d human pose

Y Feng, J Lin, SK Dwivedi, Y Sun… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce ChatPose a framework employing Large Language Models (LLMs) to
understand and reason about 3D human poses from images or textual descriptions. Our …

On the benefits of 3d pose and tracking for human action recognition

J Rajasegaran, G Pavlakos… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this work we study the benefits of using tracking and 3D poses for action recognition. To
achieve this, we take the Lagrangian view on analysing actions over a trajectory of human …

Paint-it: Text-to-texture synthesis via deep convolutional texture map optimization and physically-based rendering

K Youwang, TH Oh… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present Paint-it a text-driven high-fidelity texture map synthesis method for 3D meshes
via neural re-parameterized texture optimization. Paint-it synthesizes texture maps from a …

Hybrik-x: Hybrid analytical-neural inverse kinematics for whole-body mesh recovery

J Li, S Bian, C Xu, Z Chen, L Yang… - IEEE Transactions on …, 2025 - ieeexplore.ieee.org
Recovering whole-body mesh by inferring the abstract pose and shape parameters from
visual content can obtain 3D bodies with realistic structures. However, the inferring process …

Wham: Reconstructing world-grounded humans with accurate 3d motion

S Shin, J Kim, E Halilaj… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
The estimation of 3D human motion from video has progressed rapidly but current methods
still have several key limitations. First most methods estimate the human in camera …