Human pose estimation from monocular images: A comprehensive survey

W Gong, X Zhang, J Gonzàlez, A Sobral, T Bouwmans… - Sensors, 2016 - mdpi.com
Human pose estimation refers to the estimation of the location of body parts and how they
are connected in an image. Human pose estimation from monocular images has wide …

Monocular human pose estimation: A survey of deep learning-based methods

Y Chen, Y Tian, M He - Computer vision and image understanding, 2020 - Elsevier
Vision-based monocular human pose estimation, as one of the most fundamental and
challenging problems in computer vision, aims to obtain posture of the human body from …

End-to-end multi-task learning with attention

S Liu, E Johns, AJ Davison - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
We propose a novel multi-task learning architecture, which allows learning of task-specific
feature-level attention. Our design, the Multi-Task Attention Network (MTAN), consists of a …

Cross-stitch networks for multi-task learning

I Misra, A Shrivastava, A Gupta… - Proceedings of the …, 2016 - openaccess.thecvf.com
Multi-task learning in Convolutional Networks has displayed remarkable success in the field
of recognition. This success can be largely attributed to learning shared representations …

Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition

R Ranjan, VM Patel, R Chellappa - IEEE transactions on …, 2017 - ieeexplore.ieee.org
We present an algorithm for simultaneous face detection, landmarks localization, pose
estimation and gender recognition using deep convolutional neural networks (CNN). The …

Region-based convolutional networks for accurate object detection and segmentation

R Girshick, J Donahue, T Darrell… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
Object detection performance, as measured on the canonical PASCAL VOC Challenge
datasets, plateaued in the final years of the competition. The best-performing methods were …

Hypercolumns for object segmentation and fine-grained localization

B Hariharan, P Arbeláez, R Girshick… - Proceedings of the IEEE …, 2015 - cv-foundation.org
Recognition algorithms based on convolutional networks (CNNs) typically use the output of
the last layer as feature representation. However, the information in this layer may be too …

Dual super-resolution learning for semantic segmentation

L Wang, D Li, Y Zhu, L Tian… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Current state-of-the-art semantic segmentation methods often apply high-resolution input to
attain high performance, which brings large computation budgets and limits their …

Viewpoints and keypoints

S Tulsiani, J Malik - … of the IEEE Conference on Computer …, 2015 - openaccess.thecvf.com
We characterize the problem of pose estimation for rigid objects in terms of determining
viewpoint to explain coarse pose and keypoint prediction to capture the finer details. We …

Pastanet: Toward human activity knowledge engine

YL Li, L Xu, X Liu, X Huang, Y Xu… - Proceedings of the …, 2020 - openaccess.thecvf.com
Existing image-based activity understanding methods mainly adopt direct map**, ie from
image to activity concepts, which may encounter performance bottleneck since the huge …