Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
Recovering 3d human mesh from monocular images: A survey
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
computer vision. Since the release of statistical body models, 3D human mesh recovery has …
Humans in 4D: Reconstructing and tracking humans with transformers
We present an approach to reconstruct humans and track them over time. At the core of our
approach, we propose a fully" transformerized" version of a network for human mesh …
approach, we propose a fully" transformerized" version of a network for human mesh …
Cliff: Carrying location information in full frames into human pose and shape estimation
Top-down methods dominate the field of 3D human pose and shape estimation, because
they are decoupled from human detection and allow researchers to focus on the core …
they are decoupled from human detection and allow researchers to focus on the core …
Pure transformers are powerful graph learners
We show that standard Transformers without graph-specific modifications can lead to
promising results in graph learning both in theory and practice. Given a graph, we simply …
promising results in graph learning both in theory and practice. Given a graph, we simply …
Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
FastViT: A fast hybrid vision transformer using structural reparameterization
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …
Hybrik: A hybrid analytical-neural inverse kinematics solution for 3d human pose and shape estimation
Abstract Model-based 3D pose and shape estimation methods reconstruct a full 3D mesh for
the human body by estimating several parameters. However, learning the abstract …
the human body by estimating several parameters. However, learning the abstract …
Capturing and inferring dense full-body human-scene contact
CHP Huang, H Yi, M Höschle… - Proceedings of the …, 2022 - openaccess.thecvf.com
Inferring human-scene contact (HSC) is the first step toward understanding how humans
interact with their surroundings. While detecting 2D human-object interaction (HOI) and …
interact with their surroundings. While detecting 2D human-object interaction (HOI) and …
NIKI: Neural inverse kinematics with invertible neural networks for 3d human pose and shape estimation
With the progress of 3D human pose and shape estimation, state-of-the-art methods can
either be robust to occlusions or obtain pixel-aligned accuracy in non-occlusion cases …
either be robust to occlusions or obtain pixel-aligned accuracy in non-occlusion cases …