- Academic Search

W Jia, L Yang, Z Jia, W Zhao, Y Zhou, Q Song - Neurocomputing, 2023 - Elsevier

In this paper, we introduce TIVE, a Toolbox for Identifying Video instance segmentation
Errors. By directly operating output prediction files, TIVE defines isolated error types and …

Speichern Zitieren Zitiert von: 5 Ähnliche Artikel Alle 4 Versionen

Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video

R Feng, H Chen - Neurocomputing, 2025 - Elsevier

Temporal modeling and spatio-temporal collaboration are pivotal techniques for video-
based human pose estimation. Most state-of-the-art methods adopt optical flow or temporal …

Speichern Zitieren Ähnliche Artikel

Adept: Annotation-denoising auxiliary tasks with discrete cosine transform map and keypoint for human-centric pretraining

W He, Y Yan, S Tang, Y Deng, Y Zhong, P Luo, D Qi - Neurocomputing, 2025 - Elsevier

Human-centric perception is the core of diverse computer vision tasks and has been a long-
standing research focus. However, previous research studied these human-centric tasks …

Speichern Zitieren Ähnliche Artikel

[Free GPT-4]

[PDF] ssrn.com

MaskRecon: High-quality human reconstruction via masked autoencoders using a single RGB-D image

X Li, Y Fan, Z Guo, Z Rao, Y Duan, S Liu - Neurocomputing, 2024 - Elsevier

In this paper, we explore reconstructing high-quality clothed 3D humans from a single RGB-
D image, assuming that virtual humans can be represented by front-view and back-view …

Speichern Zitieren Ähnliche Artikel

Pose-guided hierarchical semantic decomposition and composition for human parsing

B Yang, C Yu, JG Yu, C Gao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Human parsing is a fine-grained semantic segmentation task, which needs to understand
human semantic parts. Most existing methods model human parsing as a general semantic …

Speichern Zitieren Zitiert von: 8 Ähnliche Artikel Alle 3 Versionen

Crowded pose-guided multi-task learning for instance-level human parsing

Y Wei, L Liu, X Fu, LJ Liu, W Peng - Machine Vision and Applications, 2023 - Springer

Instance-level human parsing remains challenging due to the similarity between human
instances and background, complex interactions, and various poses. Aiming at assigning …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel Alle 2 Versionen

[Free GPT-4]

[PDF] wiley.com Full View

WNet: A dual‐encoded multi‐human parsing network

MI Hosen, T Aydin, MB Islam - IET Image Processing, 2024 - Wiley Online Library

In recent years, multi‐human parsing has become a focal point in research, yet prevailing
methods often rely on intermediate stages and lacking pixel‐level analysis. Moreover, their …

Speichern Zitieren Ähnliche Artikel Alle 2 Versionen

SP-YOLO: an end-to-end lightweight network for real-time human pose estimation

Y Zhang, Z Wang, M Li, P Gao - Signal, Image and Video Processing, 2024 - Springer

The traditional multi-person human pose estimation method has several problems including
low real-time detection effect, low recognition efficiency, and a large number of calculation …

Speichern Zitieren Zitiert von: 2 Ähnliche Artikel

[Free GPT-4]

[PDF] arxiv.org

Nondiscriminatory treatment: A straightforward framework for multi-human parsing

M Yan, G Zhang, T Zhang, Y Zhang - Neurocomputing, 2021 - Elsevier

Multi-human parsing aims to segment every body part of every human instance. Nearly all
state-of-the-art methods follow the “detection first” or “segmentation first” pipelines. Different …

Speichern Zitieren Zitiert von: 3 Ähnliche Artikel Alle 4 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

SUNNet: A novel framework for simultaneous human parsing and pose estimation

TIVE: A toolbox for identifying video instance segmentation errors

Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video

Adept: Annotation-denoising auxiliary tasks with discrete cosine transform map and keypoint for human-centric pretraining

MaskRecon: High-quality human reconstruction via masked autoencoders using a single RGB-D image

Pose-guided hierarchical semantic decomposition and composition for human parsing

Crowded pose-guided multi-task learning for instance-level human parsing

WNet: A dual‐encoded multi‐human parsing network

SP-YOLO: an end-to-end lightweight network for real-time human pose estimation

Nondiscriminatory treatment: A straightforward framework for multi-human parsing