Deep learning technique for human parsing: A survey and outlook

L Yang, W Jia, S Li, Q Song - International Journal of Computer Vision, 2024 - Springer
Human parsing aims to partition humans in image or video into multiple pixel-level semantic
parts. In the last decade, it has gained significantly increased interest in the computer vision …

Transformer-based dual relation graph for multi-label image recognition

J Zhao, K Yan, Y Zhao, X Guo… - Proceedings of the …, 2021 - openaccess.thecvf.com
The simultaneous recognition of multiple objects in one image remains a challenging task,
spanning multiple events in the recognition field such as various object scales, inconsistent …

Bilateral attention network for RGB-D salient object detection

Z Zhang, Z Lin, J Xu, WD **, SP Lu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
RGB-D salient object detection (SOD) aims to segment the most attractive objects in a pair of
cross-modal RGB and depth images. Currently, most existing RGB-D SOD methods focus on …

Complementary trilateral decoder for fast and accurate salient object detection

Z Zhao, C **a, C **e, J Li - Proceedings of the 29th acm international …, 2021 - dl.acm.org
Salient object detection (SOD) has made great progress, but most of existing SOD methods
focus more on performance than efficiency. Besides, the U-shape structure exists some …

Logicseg: Parsing visual semantics with neural logic learning and reasoning

L Li, W Wang, Y Yang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Current high-performance semantic segmentation models are purely data-driven sub-
symbolic approaches and blind to the structured nature of the visual world. This is in stark …

PGDENet: Progressive guided fusion and depth enhancement network for RGB-D indoor scene parsing

W Zhou, E Yang, J Lei, J Wan… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Scene parsing is a fundamental task in computer vision. Various RGB-D (color and depth)
scene parsing methods based on fully convolutional networks have achieved excellent …

Part-aware panoptic segmentation

D de Geus, P Meletis, C Lu, X Wen… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this work, we introduce the new scene understanding task of Part-aware Panoptic
Segmentation (PPS), which aims to understand a scene at multiple levels of abstraction, and …

Semantic hierarchy-aware segmentation

L Li, W Wang, T Zhou, R Quan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world at multiple levels. However …

Panoptic-partformer: Learning a unified model for panoptic part segmentation

X Li, S Xu, Y Yang, G Cheng, Y Tong, D Tao - European Conference on …, 2022 - Springer
Abstract Panoptic Part Segmentation (PPS) aims to unify panoptic segmentation and part
segmentation into one task. Previous work mainly utilizes separated approaches to handle …

Mg-llava: Towards multi-granularity visual instruction tuning

X Zhao, X Li, H Duan, H Huang, Y Li, K Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Multi-modal large language models (MLLMs) have made significant strides in various visual
understanding tasks. However, the majority of these models are constrained to process low …