Recovering 3d human mesh from monocular images: A survey
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
A survey on graph neural networks and graph transformers in computer vision: A task-oriented perspective
Graph Neural Networks (GNNs) have gained momentum in graph representation learning
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …
and boosted the state of the art in a variety of areas, such as data mining (eg, social network …
Pymaf-x: Towards well-aligned full-body model regression from monocular images
We present PyMAF-X, a regression-based approach to recovering a parametric full-body
model from a single image. This task is very challenging since minor parametric deviation …
model from a single image. This task is very challenging since minor parametric deviation …
ARCTIC: A dataset for dexterous bimanual hand-object manipulation
Humans intuitively understand that inanimate objects do not move by themselves, but that
state changes are typically caused by human manipulation (eg, the opening of a book). This …
state changes are typically caused by human manipulation (eg, the opening of a book). This …
Reconstructing hands in 3d with transformers
We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …
A dataset of relighted 3D interacting hands
The two-hand interaction is one of the most challenging signals to analyze due to the self-
similarity, complicated articulations, and occlusions of hands. Although several datasets …
similarity, complicated articulations, and occlusions of hands. Although several datasets …
Hybrik-x: Hybrid analytical-neural inverse kinematics for whole-body mesh recovery
Recovering whole-body mesh by inferring the abstract pose and shape parameters from
visual content can obtain 3D bodies with realistic structures. However, the inferring process …
visual content can obtain 3D bodies with realistic structures. However, the inferring process …
gsdf: Geometry-driven signed distance functions for 3d hand-object reconstruction
Signed distance functions (SDFs) is an attractive framework that has recently shown
promising results for 3D shape reconstruction from images. SDFs seamlessly generalize to …
promising results for 3D shape reconstruction from images. SDFs seamlessly generalize to …
Taco: Benchmarking generalizable bimanual tool-action-object understanding
Humans commonly work with multiple objects in daily life and can intuitively transfer
manipulation skills to novel objects by understanding object functional regularities. However …
manipulation skills to novel objects by understanding object functional regularities. However …
Reconstructing interacting hands with interaction prior from monocular images
Reconstructing interacting hands from monocular images is indispensable in AR/VR
applications. Most existing solutions rely on the accurate localization of each skeleton joint …
applications. Most existing solutions rely on the accurate localization of each skeleton joint …