- Academic Search

Q Yu, H Wang, D Kim, S Qiao… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …

保存引用被引用数: 103 関連記事全 6 バージョン HTMLバージョン

Vehicle detection for autonomous driving: A review of algorithms and datasets

J Karangwa, J Liu, Z Zeng - IEEE Transactions on Intelligent …, 2023 - ieeexplore.ieee.org

Nowadays, vehicles with a high level of automation are being driven everywhere. With the
apparent success of autonomous driving technology, we keep working to achieve fully …

保存引用被引用数: 32 関連記事全 3 バージョン

[Free GPT-4]

[PDF] thecvf.com

Towards end-to-end unified scene text detection and layout analysis

S Long, S Qin, D Panteleev… - Proceedings of the …, 2022 - openaccess.thecvf.com

Scene text detection and document layout analysis have long been treated as two separate
tasks in different image domains. In this paper, we bring them together and introduce the …

保存引用被引用数: 100 関連記事全 8 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Remax: Relaxing for better training on efficient panoptic segmentation

S Sun, W Wang, A Howard, Q Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc

This paper presents a new mechanism to facilitate the training of mask transformers for
efficient panoptic segmentation, democratizing its deployment. We observe that due to the …

保存引用被引用数: 12 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] openreview.net

Moat: Alternating mobile convolution and attention brings strong vision models

C Yang, S Qiao, Q Yu, X Yuan, Y Zhu… - The Eleventh …, 2022 - openreview.net

This paper presents MOAT, a family of neural networks that build on top of MObile
convolution (ie, inverted residual blocks) and ATtention. Unlike the current works that stack …

保存引用被引用数: 69 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] thecvf.com

Tubeformer-deeplab: Video mask transformer

D Kim, J **e, H Wang, S Qiao, Q Yu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract We present TubeFormer-DeepLab, the first attempt to tackle multiple core video
segmentation tasks in a unified manner. Different video segmentation tasks (eg, video …

保存引用被引用数: 48 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mdpi.com

A review of plant disease detection systems for farming applications

MSP Ngongoma, M Kabeya, K Moloi - Applied Sciences, 2023 - mdpi.com

The globe and more particularly the economically developed regions of the world are
currently in the era of the Fourth Industrial Revolution (4IR). Conversely, the economically …

保存引用被引用数: 18 関連記事全 4 バージョンキャッシュ

[Free GPT-4]

[PDF] ecva.net

k-means Mask Transformer

Q Yu, H Wang, S Qiao, M Collins, Y Zhu… - … on Computer Vision, 2022 - Springer

The rise of transformers in vision tasks not only advances network backbone designs, but
also starts a brand-new page to achieve end-to-end image recognition (eg., object detection …

保存引用被引用数: 62 関連記事全 4 バージョン

[Free GPT-4]

[PDF] thecvf.com

Polymax: General dense prediction with mask transformer

X Yang, L Yuan, K Wilber, A Sharma… - Proceedings of the …, 2024 - openaccess.thecvf.com

Dense prediction tasks, such as semantic segmentation, depth estimation, and surface
normal prediction, can be easily formulated as per-pixel classification (discrete outputs) or …

保存引用被引用数: 12 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Waymo open dataset: Panoramic video panoptic segmentation

J Mei, AZ Zhu, X Yan, H Yan, S Qiao, LC Chen… - … on Computer Vision, 2022 - Springer

Panoptic image segmentation is the computer vision task of finding groups of pixels in an
image and assigning semantic classes and object instance identifiers to them. Research in …

保存引用被引用数: 58 関連記事全 6 バージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Deeplab2: A tensorflow library for deep labeling

Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Vehicle detection for autonomous driving: A review of algorithms and datasets

Towards end-to-end unified scene text detection and layout analysis

Remax: Relaxing for better training on efficient panoptic segmentation

Moat: Alternating mobile convolution and attention brings strong vision models

Tubeformer-deeplab: Video mask transformer

A review of plant disease detection systems for farming applications

k-means Mask Transformer

Polymax: General dense prediction with mask transformer

Waymo open dataset: Panoramic video panoptic segmentation