Transformers in vision: A survey

S Khan, M Naseer, M Hayat, SW Zamir… - ACM computing …, 2022 - dl.acm.org
Astounding results from Transformer models on natural language tasks have intrigued the
vision community to study their application to computer vision problems. Among their salient …

Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

Executing your commands via motion diffusion in latent space

X Chen, B Jiang, W Liu, Z Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We study a challenging task, conditional human motion generation, which produces
plausible human motion sequences according to various conditional inputs, such as action …

Mvimgnet: A large-scale dataset of multi-view images

X Yu, M Xu, Y Zhang, H Liu, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …

Avatarclip: Zero-shot text-driven generation and animation of 3d avatars

F Hong, M Zhang, L Pan, Z Cai, L Yang… - arxiv preprint arxiv …, 2022 - arxiv.org
3D avatar creation plays a crucial role in the digital age. However, the whole production
process is prohibitively time-consuming and labor-intensive. To democratize this technology …

Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion

MJ Black, P Patel, J Tesch… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …

Cliff: Carrying location information in full frames into human pose and shape estimation

Z Li, J Liu, Z Zhang, S Xu, Y Yan - European Conference on Computer …, 2022 - Springer
Top-down methods dominate the field of 3D human pose and shape estimation, because
they are decoupled from human detection and allow researchers to focus on the core …

PARE: Part attention regressor for 3D human body estimation

M Kocabas, CHP Huang, O Hilliges… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite significant progress, we show that state of the art 3D human pose and shape
estimation methods remain sensitive to partial occlusion and can produce dramatically …

One-stage 3d whole-body mesh recovery with component aware transformer

J Lin, A Zeng, H Wang, L Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Whole-body mesh recovery aims to estimate the 3D human body, face, and hands
parameters from a single image. It is challenging to perform this task with a single network …

3D human pose estimation via intuitive physics

S Tripathi, L Müller, CHP Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Estimating 3D humans from images often produces implausible bodies that lean, float, or
penetrate the floor. Such methods ignore the fact that bodies are typically supported by the …