Transformers in vision: A survey
Astounding results from Transformer models on natural language tasks have intrigued the
vision community to study their application to computer vision problems. Among their salient …
vision community to study their application to computer vision problems. Among their salient …
Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
Executing your commands via motion diffusion in latent space
We study a challenging task, conditional human motion generation, which produces
plausible human motion sequences according to various conditional inputs, such as action …
plausible human motion sequences according to various conditional inputs, such as action …
Mvimgnet: A large-scale dataset of multi-view images
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
Avatarclip: Zero-shot text-driven generation and animation of 3d avatars
3D avatar creation plays a crucial role in the digital age. However, the whole production
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
Cliff: Carrying location information in full frames into human pose and shape estimation
Top-down methods dominate the field of 3D human pose and shape estimation, because
they are decoupled from human detection and allow researchers to focus on the core …
they are decoupled from human detection and allow researchers to focus on the core …
PARE: Part attention regressor for 3D human body estimation
Despite significant progress, we show that state of the art 3D human pose and shape
estimation methods remain sensitive to partial occlusion and can produce dramatically …
estimation methods remain sensitive to partial occlusion and can produce dramatically …
One-stage 3d whole-body mesh recovery with component aware transformer
Whole-body mesh recovery aims to estimate the 3D human body, face, and hands
parameters from a single image. It is challenging to perform this task with a single network …
parameters from a single image. It is challenging to perform this task with a single network …
3D human pose estimation via intuitive physics
Estimating 3D humans from images often produces implausible bodies that lean, float, or
penetrate the floor. Such methods ignore the fact that bodies are typically supported by the …
penetrate the floor. Such methods ignore the fact that bodies are typically supported by the …