- Academic Search

L Yang, W Jia, S Li, Q Song - International Journal of Computer Vision, 2024 - Springer

Human parsing aims to partition humans in image or video into multiple pixel-level semantic
parts. In the last decade, it has gained significantly increased interest in the computer vision …

Save Cite Cited by 21 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Transformer-based dual relation graph for multi-label image recognition

J Zhao, K Yan, Y Zhao, X Guo… - Proceedings of the …, 2021 - openaccess.thecvf.com

The simultaneous recognition of multiple objects in one image remains a challenging task,
spanning multiple events in the recognition field such as various object scales, inconsistent …

Save Cite Cited by 109 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Bilateral attention network for RGB-D salient object detection

Z Zhang, Z Lin, J Xu, WD **, SP Lu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

RGB-D salient object detection (SOD) aims to segment the most attractive objects in a pair of
cross-modal RGB and depth images. Currently, most existing RGB-D SOD methods focus on …

Save Cite Cited by 199 Related articles All 8 versions Free GPT-4

[Free GPT-4]

[PDF] google.com

Complementary trilateral decoder for fast and accurate salient object detection

Z Zhao, C **a, C **e, J Li - Proceedings of the 29th acm international …, 2021 - dl.acm.org

Salient object detection (SOD) has made great progress, but most of existing SOD methods
focus more on performance than efficiency. Besides, the U-shape structure exists some …

Save Cite Cited by 109 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Logicseg: Parsing visual semantics with neural logic learning and reasoning

L Li, W Wang, Y Yang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Current high-performance semantic segmentation models are purely data-driven sub-
symbolic approaches and blind to the structured nature of the visual world. This is in stark …

Save Cite Cited by 33 Related articles All 7 versions Free GPT-4 View as HTML

PGDENet: Progressive guided fusion and depth enhancement network for RGB-D indoor scene parsing

W Zhou, E Yang, J Lei, J Wan… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Scene parsing is a fundamental task in computer vision. Various RGB-D (color and depth)
scene parsing methods based on fully convolutional networks have achieved excellent …

Save Cite Cited by 78 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Part-aware panoptic segmentation

D de Geus, P Meletis, C Lu, X Wen… - Proceedings of the …, 2021 - openaccess.thecvf.com

In this work, we introduce the new scene understanding task of Part-aware Panoptic
Segmentation (PPS), which aims to understand a scene at multiple levels of abstraction, and …

Save Cite Cited by 66 Related articles All 7 versions Free GPT-4 View as HTML

Semantic hierarchy-aware segmentation

L Li, W Wang, T Zhou, R Quan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world at multiple levels. However …

Save Cite Cited by 23 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Panoptic-partformer: Learning a unified model for panoptic part segmentation

X Li, S Xu, Y Yang, G Cheng, Y Tong, D Tao - European Conference on …, 2022 - Springer

Abstract Panoptic Part Segmentation (PPS) aims to unify panoptic segmentation and part
segmentation into one task. Previous work mainly utilizes separated approaches to handle …

Save Cite Cited by 47 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Mg-llava: Towards multi-granularity visual instruction tuning

X Zhao, X Li, H Duan, H Huang, Y Li, K Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Multi-modal large language models (MLLMs) have made significant strides in various visual
understanding tasks. However, the majority of these models are constrained to process low …

Save Cite Cited by 8 Related articles All 3 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Multi-class part parsing with joint boundary-semantic awareness

Deep learning technique for human parsing: A survey and outlook

Transformer-based dual relation graph for multi-label image recognition

Bilateral attention network for RGB-D salient object detection

Complementary trilateral decoder for fast and accurate salient object detection

Logicseg: Parsing visual semantics with neural logic learning and reasoning

PGDENet: Progressive guided fusion and depth enhancement network for RGB-D indoor scene parsing

Part-aware panoptic segmentation

Semantic hierarchy-aware segmentation

Panoptic-partformer: Learning a unified model for panoptic part segmentation

Mg-llava: Towards multi-granularity visual instruction tuning