Transformers in vision: A survey
Astounding results from Transformer models on natural language tasks have intrigued the
vision community to study their application to computer vision problems. Among their salient …
vision community to study their application to computer vision problems. Among their salient …
Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
Motiondiffuse: Text-driven human motion generation with diffusion model
Human motion modeling is important for many modern graphics applications, which typically
require professional skills. In order to remove the skill barriers for laymen, recent motion …
require professional skills. In order to remove the skill barriers for laymen, recent motion …
Humans in 4D: Reconstructing and tracking humans with transformers
We present an approach to reconstruct humans and track them over time. At the core of our
approach, we propose a fully" transformerized" version of a network for human mesh …
approach, we propose a fully" transformerized" version of a network for human mesh …
Generating diverse and natural 3d human motions from text
Automated generation of 3D human motions from text is a challenging problem. The
generated motions are expected to be sufficiently diverse to explore the text-grounded …
generated motions are expected to be sufficiently diverse to explore the text-grounded …
Stablerep: Synthetic images from text-to-image models make strong visual representation learners
We investigate the potential of learning visual representations using synthetic images
generated by text-to-image models. This is a natural question in the light of the excellent …
generated by text-to-image models. This is a natural question in the light of the excellent …
Avatarclip: Zero-shot text-driven generation and animation of 3d avatars
3D avatar creation plays a crucial role in the digital age. However, the whole production
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
TEMOS: Generating Diverse Human Motions from Textual Descriptions
We address the problem of generating diverse 3D human motions from textual descriptions.
This challenging task requires joint modeling of both modalities: understanding and …
This challenging task requires joint modeling of both modalities: understanding and …
Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
Humanrf: High-fidelity neural radiance fields for humans in motion
Representing human performance at high-fidelity is an essential building block in diverse
applications, such as film production, computer games or videoconferencing. To close the …
applications, such as film production, computer games or videoconferencing. To close the …