Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Q Yu, H Wang, D Kim, S Qiao… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …

Vehicle detection for autonomous driving: A review of algorithms and datasets

J Karangwa, J Liu, Z Zeng - IEEE Transactions on Intelligent …, 2023 - ieeexplore.ieee.org
Nowadays, vehicles with a high level of automation are being driven everywhere. With the
apparent success of autonomous driving technology, we keep working to achieve fully …

Towards end-to-end unified scene text detection and layout analysis

S Long, S Qin, D Panteleev… - Proceedings of the …, 2022 - openaccess.thecvf.com
Scene text detection and document layout analysis have long been treated as two separate
tasks in different image domains. In this paper, we bring them together and introduce the …

Remax: Relaxing for better training on efficient panoptic segmentation

S Sun, W Wang, A Howard, Q Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc
This paper presents a new mechanism to facilitate the training of mask transformers for
efficient panoptic segmentation, democratizing its deployment. We observe that due to the …

Moat: Alternating mobile convolution and attention brings strong vision models

C Yang, S Qiao, Q Yu, X Yuan, Y Zhu… - The Eleventh …, 2022 - openreview.net
This paper presents MOAT, a family of neural networks that build on top of MObile
convolution (ie, inverted residual blocks) and ATtention. Unlike the current works that stack …

Tubeformer-deeplab: Video mask transformer

D Kim, J **e, H Wang, S Qiao, Q Yu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We present TubeFormer-DeepLab, the first attempt to tackle multiple core video
segmentation tasks in a unified manner. Different video segmentation tasks (eg, video …

A review of plant disease detection systems for farming applications

MSP Ngongoma, M Kabeya, K Moloi - Applied Sciences, 2023 - mdpi.com
The globe and more particularly the economically developed regions of the world are
currently in the era of the Fourth Industrial Revolution (4IR). Conversely, the economically …

k-means Mask Transformer

Q Yu, H Wang, S Qiao, M Collins, Y Zhu… - … on Computer Vision, 2022 - Springer
The rise of transformers in vision tasks not only advances network backbone designs, but
also starts a brand-new page to achieve end-to-end image recognition (eg., object detection …

Polymax: General dense prediction with mask transformer

X Yang, L Yuan, K Wilber, A Sharma… - Proceedings of the …, 2024 - openaccess.thecvf.com
Dense prediction tasks, such as semantic segmentation, depth estimation, and surface
normal prediction, can be easily formulated as per-pixel classification (discrete outputs) or …

Waymo open dataset: Panoramic video panoptic segmentation

J Mei, AZ Zhu, X Yan, H Yan, S Qiao, LC Chen… - … on Computer Vision, 2022 - Springer
Panoptic image segmentation is the computer vision task of finding groups of pixels in an
image and assigning semantic classes and object instance identifiers to them. Research in …