Deep learning-based human pose estimation: A survey

C Zheng, W Wu, C Chen, T Yang, S Zhu, J Shen… - ACM Computing …, 2023 - dl.acm.org
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …

Recent advances of monocular 2D and 3D human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Not all tokens are equal: Human-centric visual analysis via token clustering transformer

W Zeng, S **, W Liu, C Qian, P Luo… - Proceedings of the …, 2022 - openaccess.thecvf.com
Vision transformers have achieved great successes in many computer vision tasks. Most
methods generate vision tokens by splitting an image into a regular and fixed grid and …

Neural architecture search for spiking neural networks

Y Kim, Y Li, H Park, Y Venkatesha, P Panda - European conference on …, 2022 - Springer
Abstract Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-
efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent …

Tokenlearner: What can 8 learned tokens do for images and videos?

MS Ryoo, AJ Piergiovanni, A Arnab… - arxiv preprint arxiv …, 2021 - arxiv.org
In this paper, we introduce a novel visual representation learning which relies on a handful
of adaptively learned tokens, and which is applicable to both image and video …

Vision-based human pose estimation via deep learning: A survey

G Lan, Y Wu, F Hu, Q Hao - IEEE Transactions on Human …, 2022 - ieeexplore.ieee.org
Human pose estimation (HPE) has attracted a significant amount of attention from the
computer vision community in the past decades. Moreover, HPE has been applied to various …

Faster voxelpose: Real-time 3d human pose estimation by orthographic projection

H Ye, W Zhu, C Wang, R Wu, Y Wang - European Conference on …, 2022 - Springer
While the voxel-based methods have achieved promising results for multi-person 3D pose
estimation from multi-cameras, they suffer from heavy computation burdens, especially for …

Pocketnet: Extreme lightweight face recognition network using neural architecture search and multistep knowledge distillation

F Boutros, P Siebke, M Klemt, N Damer… - IEEE …, 2022 - ieeexplore.ieee.org
Deep neural networks have rapidly become the mainstream method for face recognition
(FR). However, this limits the deployment of such models that contain an extremely large …

Zoomnas: searching for whole-body human pose estimation in the wild

L Xu, S **, W Liu, C Qian, W Ouyang… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
This paper investigates the task of 2D whole-body human pose estimation, which aims to
localize dense landmarks on the entire human body including body, feet, face, and hands …

Pose for everything: Towards category-agnostic pose estimation

L Xu, S **, W Zeng, W Liu, C Qian, W Ouyang… - European conference on …, 2022 - Springer
Existing works on 2D pose estimation mainly focus on a certain category, eg human, animal,
and vehicle. However, there are lots of application scenarios that require detecting the …