State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

[HTML][HTML] Deep 3D human pose estimation: A review

J Wang, S Tan, X Zhen, S Xu, F Zheng, Z He… - Computer Vision and …, 2021 - Elsevier
Abstract Three-dimensional (3D) human pose estimation involves estimating the articulated
3D joint locations of a human body from an image or video. Due to its widespread …

Humans in 4D: Reconstructing and tracking humans with transformers

S Goel, G Pavlakos, J Rajasegaran… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present an approach to reconstruct humans and track them over time. At the core of our
approach, we propose a fully" transformerized" version of a network for human mesh …

Motiondiffuse: Text-driven human motion generation with diffusion model

M Zhang, Z Cai, L Pan, F Hong, X Guo… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Human motion modeling is important for many modern graphics applications, which typically
require professional skills. In order to remove the skill barriers for laymen, recent motion …

A dual-augmentor framework for domain generalization in 3d human pose estimation

Q Peng, C Zheng, C Chen - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Abstract 3D human pose data collected in controlled laboratory settings present challenges
for pose estimators that generalize across diverse scenarios. To address this domain …

Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion

MJ Black, P Patel, J Tesch… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …

Poseformerv2: Exploring frequency domain for efficient and robust 3d human pose estimation

Q Zhao, C Zheng, M Liu, P Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, transformer-based methods have gained significant success in sequential 2D-to-
3D lifting human pose estimation. As a pioneering work, PoseFormer captures spatial …

Motionbert: A unified perspective on learning human motion representations

W Zhu, X Ma, Z Liu, L Liu, W Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a unified perspective on tackling various human-centric video tasks by learning
human motion representations from large-scale and heterogeneous data resources …

3d human pose estimation with spatio-temporal criss-cross attention

Z Tang, Z Qiu, Y Hao, R Hong… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Recent transformer-based solutions have shown great success in 3D human pose
estimation. Nevertheless, to calculate the joint-to-joint affinity matrix, the computational cost …

Humanrf: High-fidelity neural radiance fields for humans in motion

M Işık, M Rünz, M Georgopoulos, T Khakhulin… - ACM Transactions on …, 2023 - dl.acm.org
Representing human performance at high-fidelity is an essential building block in diverse
applications, such as film production, computer games or videoconferencing. To close the …