Recovering 3d human mesh from monocular images: A survey
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
State of the Art in Dense Monocular Non‐Rigid 3D Reconstruction
Abstract 3D reconstruction of deformable (or non‐rigid) scenes from a set of monocular 2D
image observations is a long‐standing and actively researched area of computer vision and …
image observations is a long‐standing and actively researched area of computer vision and …
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
Abstract We present Ego-Exo4D a diverse large-scale multimodal multiview video dataset
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …
Reconstructing hands in 3d with transformers
We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …
Keypoint transformer: Solving joint identification in challenging hands and object interactions for accurate 3d pose estimation
We propose a robust and accurate method for estimating the 3D poses of two hands in close
interaction from a single color image. This is a very challenging problem, as large occlusions …
interaction from a single color image. This is a very challenging problem, as large occlusions …
Taco: Benchmarking generalizable bimanual tool-action-object understanding
Humans commonly work with multiple objects in daily life and can intuitively transfer
manipulation skills to novel objects by understanding object functional regularities. However …
manipulation skills to novel objects by understanding object functional regularities. However …
H2onet: Hand-occlusion-and-orientation-aware network for real-time 3d hand mesh reconstruction
Real-time 3D hand mesh reconstruction is challenging, especially when the hand is holding
some object. Beyond the previous methods, we design H2ONet to fully exploit non-occluded …
some object. Beyond the previous methods, we design H2ONet to fully exploit non-occluded …
Showme: Benchmarking object-agnostic hand-object 3d reconstruction
Recent hand-object interaction datasets show limited real object variability and rely on fitting
the MANO parametric model to obtain groundtruth hand shapes. To go beyond these …
the MANO parametric model to obtain groundtruth hand shapes. To go beyond these …
Hierarchical temporal transformer for 3d hand pose estimation and action recognition from egocentric rgb videos
Understanding dynamic hand motions and actions from egocentric RGB videos is a
fundamental yet challenging task due to self-occlusion and ambiguity. To address occlusion …
fundamental yet challenging task due to self-occlusion and ambiguity. To address occlusion …
Diverse 3d hand gesture prediction from body dynamics by bilateral hand disentanglement
Predicting natural and diverse 3D hand gestures from the upper body dynamics is a
practical yet challenging task in virtual avatar creation. Previous works usually overlook the …
practical yet challenging task in virtual avatar creation. Previous works usually overlook the …