Human pose estimation using deep learning: a systematic literature review

E Samkari, M Arif, M Alghamdi… - Machine Learning and …, 2023 - mdpi.com
Human Pose Estimation (HPE) is the task that aims to predict the location of human joints
from images and videos. This task is used in many applications, such as sports analysis and …

DiffPose: SpatioTemporal diffusion model for video-based human pose estimation

R Feng, Y Gao, THE Tse, X Ma… - Proceedings of the …, 2023 - openaccess.thecvf.com
Denoising diffusion probabilistic models that were initially proposed for realistic image
generation have recently shown success in various perception tasks (eg, object detection …

Joint-Motion Mutual Learning for Pose Estimation in Video

S Wu, H Chen, Y Yin, S Hu, R Feng, Y Jiao… - Proceedings of the …, 2024 - dl.acm.org
Human pose estimation in videos has long been a compelling yet challenging task within
the realm of computer vision. Nevertheless, this task remains difficult because of the …

Human pose-based estimation, tracking and action recognition with deep learning: A survey

L Zhou, X Meng, Z Liu, M Wu, Z Gao… - arxiv preprint arxiv …, 2023 - arxiv.org
Human pose analysis has garnered significant attention within both the research community
and practical applications, owing to its expanding array of uses, including gaming, video …

Building a Strong Pre-Training Baseline for Universal 3D Large-Scale Perception

H Chen, Z Zhang, Y Qu, R Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
An effective pre-training framework with universal 3D representations is extremely desired in
perceiving large-scale dynamic scenes. However establishing such an ideal framework that …

Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning

J Jeong, D Park, KJ Yoon - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Human pose forecasting garners attention for its diverse applications. However challenges
in modeling the multi-modal nature of human motion and intricate interactions among agents …

An improved high-resolution network-based method for yoga-pose estimation

J Li, D Zhang, L Shi, T Ke, C Zhang - Applied Sciences, 2023 - mdpi.com
In this paper, SEPAM_HRNet, a high-resolution pose-estimation model that incorporates the
squeeze-and-excitation and pixel-attention-mask (SEPAM) module is proposed. Feature …

Spectral graphormer: Spectral graph-based transformer for egocentric two-hand reconstruction using multi-view color images

THE Tse, F Mueller, Z Shen, D Tang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a novel transformer-based framework that reconstructs two high fidelity hands
from multi-view RGB images. Unlike existing hand pose estimation methods, where one …

Video-Based Human Pose Regression via Decoupled Space-Time Aggregation

J He, W Yang - Proceedings of the IEEE/CVF Conference …, 2024 - openaccess.thecvf.com
By leveraging temporal dependency in video sequences multi-frame human pose estimation
algorithms have demonstrated remarkable results in complicated situations such as …

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding

Y Wang, Y Wang, P Wu, J Liang, D Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
Despite progress in video-language modeling, the computational challenge of interpreting
long-form videos in response to task-specific linguistic queries persists, largely due to the …