- Academic Search

W Gu, S Bai, L Kong - Image and Vision Computing, 2022 - Elsevier

Image instance segmentation involves labeling pixels of images with classes and instances,
which is one of the pivotal technologies in many domains, such as natural scenes …

Save Cite Cited by 169 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Masked-attention mask transformer for universal image segmentation

B Cheng, I Misra, AG Schwing… - Proceedings of the …, 2022 - openaccess.thecvf.com

Image segmentation groups pixels with different semantics, eg, category or instance
membership. Each choice of semantics defines a task. While only the semantics of each task …

Save Cite Cited by 2374 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Mask dino: Towards a unified transformer-based framework for object detection and segmentation

F Li, H Zhang, H Xu, S Liu, L Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this paper we present Mask DINO, a unified object detection and segmentation
framework. Mask DINO extends DINO (DETR with Improved Denoising Anchor Boxes) by …

Save Cite Cited by 422 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

Save Cite Cited by 166 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

Save Cite Cited by 455 Related articles All 22 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Detrs with hybrid matching

D Jia, Y Yuan, H He, X Wu, H Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com

One-to-one set matching is a key design for DETR to establish its end-to-end capability, so
that object detection does not require a hand-crafted NMS (non-maximum suppression) to …

Save Cite Cited by 221 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Save Cite Cited by 122 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Maptrv2: An end-to-end framework for online vectorized hd map construction

B Liao, S Chen, Y Zhang, B Jiang, Q Zhang… - International Journal of …, 2024 - Springer

High-definition (HD) map provides abundant and precise static environmental information of
the driving scene, serving as a fundamental and indispensable component for planning in …

Save Cite Cited by 94 Related articles All 2 versions Free GPT-4

[Free GPT-4]

[HTML] ieee-jas.net

[HTML][HTML] Coarse-to-fine video instance segmentation with factorized conditional appearance flows

Z Qin, X Lu, X Nie, D Liu, Y Yin, W Wang - IEEE/CAA Journal of …, 2023 - ieee-jas.net

We introduce a novel method using a new generative model that automatically learns
effective representations of the target and background appearance to detect, segment and …

Save Cite Cited by 70 Related articles All 4 versions Free GPT-4 Cached

[Free GPT-4]

[PDF] arxiv.org

Maptr: Structured modeling and learning for online vectorized hd map construction

B Liao, S Chen, X Wang, T Cheng, Q Zhang… - arxiv preprint arxiv …, 2022 - arxiv.org

High-definition (HD) map provides abundant and precise environmental information of the
driving scene, serving as a fundamental and indispensable component for planning in …

Save Cite Cited by 221 Related articles All 3 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Instances as queries

A review on 2D instance segmentation based on deep neural networks

Masked-attention mask transformer for universal image segmentation

Mask dino: Towards a unified transformer-based framework for object detection and segmentation

Universal instance perception as object discovery and retrieval

A survey of visual transformers

Detrs with hybrid matching

Transformer-based visual segmentation: A survey

Maptrv2: An end-to-end framework for online vectorized hd map construction

[HTML][HTML] Coarse-to-fine video instance segmentation with factorized conditional appearance flows

Maptr: Structured modeling and learning for online vectorized hd map construction