Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Lite-hrnet: A lightweight high-resolution network

C Yu, B **ao, C Gao, L Yuan, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present an efficient high-resolution network, Lite-HRNet, for human pose estimation. We
start by simply applying the efficient shuffle block in ShuffleNet to HRNet (high-resolution …

Bottom-up human pose estimation via disentangled keypoint regression

Z Geng, K Sun, B **ao, Z Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we are interested in the bottom-up paradigm of estimating human poses from
an image. We study the dense keypoint regression framework that is previously inferior to …

Tokenpose: Learning keypoint tokens for human pose estimation

Y Li, S Zhang, Z Wang, S Yang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Human pose estimation deeply relies on visual clues and anatomical constraints between
parts to locate keypoints. Most existing CNN-based methods do well in visual …

Transpose: Keypoint localization via transformer

S Yang, Z Quan, M Nie, W Yang - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
While CNN-based models have made remarkable progress on human pose estimation,
what spatial dependencies they capture to localize keypoints remains unclear. In this work …

Human pose as compositional tokens

Z Geng, C Wang, Y Wei, Z Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human pose is typically represented by a coordinate vector of body joints or their heatmap
embeddings. While easy for data processing, unrealistic pose estimates are admitted due to …

Deep high-resolution representation learning for visual recognition

J Wang, K Sun, T Cheng, B Jiang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …

Pose recognition with cascade transformers

K Li, S Wang, X Zhang, Y Xu… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we present a regression-based pose recognition method using cascade
Transformers. One way to categorize the existing approaches in this domain is to separate …

Not all tokens are equal: Human-centric visual analysis via token clustering transformer

W Zeng, S **, W Liu, C Qian, P Luo… - Proceedings of the …, 2022 - openaccess.thecvf.com
Vision transformers have achieved great successes in many computer vision tasks. Most
methods generate vision tokens by splitting an image into a regular and fixed grid and …