Recent advances of monocular 2d and 3d human pose estimation: A deep learning perspective

W Liu, Q Bao, Y Sun, T Mei - ACM Computing Surveys, 2022 - dl.acm.org
Estimation of the human pose from a monocular camera has been an emerging research
topic in the computer vision community with many applications. Recently, benefiting from the …

Human pose estimation and its application to action recognition: A survey

L Song, G Yu, J Yuan, Z Liu - Journal of Visual Communication and Image …, 2021 - Elsevier
Human pose estimation aims at predicting the poses of human body parts in images or
videos. Since pose motions are often driven by some specific human actions, knowing the …

Crosstransformers: spatially-aware few-shot transfer

C Doersch, A Gupta… - Advances in Neural …, 2020 - proceedings.neurips.cc
Given new tasks with very little data---such as new classes in a classification problem or a
domain shift in the input---performance of modern vision systems degrades remarkably …

Monocular human pose estimation: A survey of deep learning-based methods

Y Chen, Y Tian, M He - Computer vision and image understanding, 2020 - Elsevier
Vision-based monocular human pose estimation, as one of the most fundamental and
challenging problems in computer vision, aims to obtain posture of the human body from …

Vnect: Real-time 3d human pose estimation with a single rgb camera

D Mehta, S Sridhar, O Sotnychenko, H Rhodin… - Acm transactions on …, 2017 - dl.acm.org
We present the first real-time method to capture the full global 3D skeletal pose of a human
in a stable, temporally consistent manner using a single RGB camera. Our method combines …

Temporal segment networks for action recognition in videos

L Wang, Y **ong, Z Wang, Y Qiao, D Lin… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
We present a general and flexible video-level framework for learning action models in
videos. This method, called temporal segment network (TSN), aims to model long-range …

Deepfashion: Powering robust clothes recognition and retrieval with rich annotations

Z Liu, P Luo, S Qiu, X Wang… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
Recent advances in clothes recognition have been driven by the construction of clothes
datasets. Existing datasets are limited in the amount of annotations and are difficult to cope …

Graph stacked hourglass networks for 3d human pose estimation

T Xu, W Takano - Proceedings of the IEEE/CVF conference …, 2021 - openaccess.thecvf.com
In this paper, we propose a novel graph convolutional network architecture, Graph Stacked
Hourglass Networks, for 2D-to-3D human pose estimation tasks. The proposed architecture …

Ordinal depth supervision for 3d human pose estimation

G Pavlakos, X Zhou, K Daniilidis - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Our ability to train end-to-end systems for 3D human pose estimation from single images is
currently constrained by the limited availability of 3D annotations for natural images. Most …

Survey on emotional body gesture recognition

F Noroozi, CA Corneanu, D Kamińska… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
Automatic emotion recognition has become a trending research topic in the past decade.
While works based on facial expressions or speech abound, recognizing affect from body …