- Academic Search

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Zapisz Cytuj Cytowane przez 142 Powiązane artykuły Wszystkie wersje 9

[Free GPT-4]

[PDF] nowpublishers.com

[PDF][PDF] YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems

CY Wang, HYM Liao - APSIPA Transactions on Signal and …, 2024 - nowpublishers.com

This is a comprehensive review of the YOLO series of systems. Different from previous
literature surveys, this review article reexamines the characteristics of the YOLO series from …

Zapisz Cytuj Cytowane przez 14 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] neurips.cc

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2023 - proceedings.neurips.cc

Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

Zapisz Cytuj Cytowane przez 126 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

Zapisz Cytuj Cytowane przez 165 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Cut and learn for unsupervised object detection and instance segmentation

X Wang, R Girdhar, SX Yu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Abstract We propose Cut-and-LEaRn (CutLER), a simple approach for training
unsupervised object detection and segmentation models. We leverage the property of self …

Zapisz Cytuj Cytowane przez 179 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Images speak in images: A generalist painter for in-context visual learning

X Wang, W Wang, Y Cao, C Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com

In-context learning, as a new paradigm in NLP, allows the model to rapidly adapt to various
tasks with only a handful of prompts and examples. But in computer vision, the difficulties for …

Zapisz Cytuj Cytowane przez 240 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]

[PDF] arxiv.org

Seggpt: Segmenting everything in context

X Wang, X Zhang, Y Cao, W Wang, C Shen… - arxiv preprint arxiv …, 2023 - arxiv.org

We present SegGPT, a generalist model for segmenting everything in context. We unify
various segmentation tasks into a generalist in-context learning framework that …

Zapisz Cytuj Cytowane przez 252 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]

[PDF] thecvf.com

Masked-attention mask transformer for universal image segmentation

B Cheng, I Misra, AG Schwing… - Proceedings of the …, 2022 - openaccess.thecvf.com

Image segmentation groups pixels with different semantics, eg, category or instance
membership. Each choice of semantics defines a task. While only the semantics of each task …

Zapisz Cytuj Cytowane przez 2358 Powiązane artykuły Wszystkie wersje 7 Wersja HTML

[Free GPT-4]

[PDF] ieee.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Zapisz Cytuj Cytowane przez 119 Powiązane artykuły Wszystkie wersje 3

[Free GPT-4]

[PDF] thecvf.com

Conditional detr for fast training convergence

D Meng, X Chen, Z Fan, G Zeng, H Li… - Proceedings of the …, 2021 - openaccess.thecvf.com

The recently-developed DETR approach applies the transformer encoder and decoder
architecture to object detection and achieves promising performance. In this paper, we …

Zapisz Cytuj Cytowane przez 738 Powiązane artykuły Wszystkie wersje 6 Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Solov2: Dynamic and fast instance segmentation

Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

[PDF][PDF] YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Universal instance perception as object discovery and retrieval

Cut and learn for unsupervised object detection and instance segmentation

Images speak in images: A generalist painter for in-context visual learning

Seggpt: Segmenting everything in context

Masked-attention mask transformer for universal image segmentation

Transformer-based visual segmentation: A survey

Conditional detr for fast training convergence