Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Bottom-up human pose estimation via disentangled keypoint regression

Z Geng, K Sun, B **ao, Z Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we are interested in the bottom-up paradigm of estimating human poses from
an image. We study the dense keypoint regression framework that is previously inferior to …

Fast fourier convolution

L Chi, B Jiang, Y Mu - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Vanilla convolutions in modern deep networks are known to operate locally and at fixed
scale (eg, the widely-adopted 3* 3 kernels in image-oriented tasks). This causes low efficacy …

Human pose as compositional tokens

Z Geng, C Wang, Y Wei, Z Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human pose is typically represented by a coordinate vector of body joints or their heatmap
embeddings. While easy for data processing, unrealistic pose estimates are admitted due to …

Distribution-aware coordinate representation for human pose estimation

F Zhang, X Zhu, H Dai, M Ye… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
While being the de facto standard coordinate representation for human pose estimation,
heatmap has not been investigated in-depth. This work fills this gap. For the first time, we …

Ghum & ghuml: Generative 3d human shape and articulated pose models

H Xu, EG Bazavan, A Zanfir… - Proceedings of the …, 2020 - openaccess.thecvf.com
We present a statistical, articulated 3D human shape modeling pipeline, within a fully
trainable, modular, deep learning framework. Given high-resolution complete 3D body …

Learning normal dynamics in videos with meta prototype network

H Lv, C Chen, Z Cui, C Xu, Y Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
Frame reconstruction (current or future frames) based on Auto-Encoder (AE) is a popular
method for video anomaly detection. With models trained on the normal data, the …

Deep dual consecutive network for human pose estimation

Z Liu, H Chen, R Feng, S Wu, S Ji… - Proceedings of the …, 2021 - openaccess.thecvf.com
Multi-frame human pose estimation in complicated situations is challenging. Although state-
of-the-art human joints detectors have demonstrated remarkable results for static images …

Learning delicate local representations for multi-person pose estimation

Y Cai, Z Wang, Z Luo, B Yin, A Du, H Wang… - Computer Vision–ECCV …, 2020 - Springer
In this paper, we propose a novel method called Residual Steps Network (RSN). RSN
aggregates features with the same spatial size (Intra-level features) efficiently to obtain …