Cmt-deeplab: Clustering mask transformers for panoptic segmentation
Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …
framework for panoptic segmentation designed around clustering. It rethinks the existing …
Vehicle detection for autonomous driving: A review of algorithms and datasets
J Karangwa, J Liu, Z Zeng - IEEE Transactions on Intelligent …, 2023 - ieeexplore.ieee.org
Nowadays, vehicles with a high level of automation are being driven everywhere. With the
apparent success of autonomous driving technology, we keep working to achieve fully …
apparent success of autonomous driving technology, we keep working to achieve fully …
Towards end-to-end unified scene text detection and layout analysis
Scene text detection and document layout analysis have long been treated as two separate
tasks in different image domains. In this paper, we bring them together and introduce the …
tasks in different image domains. In this paper, we bring them together and introduce the …
Remax: Relaxing for better training on efficient panoptic segmentation
This paper presents a new mechanism to facilitate the training of mask transformers for
efficient panoptic segmentation, democratizing its deployment. We observe that due to the …
efficient panoptic segmentation, democratizing its deployment. We observe that due to the …
Moat: Alternating mobile convolution and attention brings strong vision models
This paper presents MOAT, a family of neural networks that build on top of MObile
convolution (ie, inverted residual blocks) and ATtention. Unlike the current works that stack …
convolution (ie, inverted residual blocks) and ATtention. Unlike the current works that stack …
Tubeformer-deeplab: Video mask transformer
Abstract We present TubeFormer-DeepLab, the first attempt to tackle multiple core video
segmentation tasks in a unified manner. Different video segmentation tasks (eg, video …
segmentation tasks in a unified manner. Different video segmentation tasks (eg, video …
A review of plant disease detection systems for farming applications
The globe and more particularly the economically developed regions of the world are
currently in the era of the Fourth Industrial Revolution (4IR). Conversely, the economically …
currently in the era of the Fourth Industrial Revolution (4IR). Conversely, the economically …
k-means Mask Transformer
The rise of transformers in vision tasks not only advances network backbone designs, but
also starts a brand-new page to achieve end-to-end image recognition (eg., object detection …
also starts a brand-new page to achieve end-to-end image recognition (eg., object detection …
Polymax: General dense prediction with mask transformer
Dense prediction tasks, such as semantic segmentation, depth estimation, and surface
normal prediction, can be easily formulated as per-pixel classification (discrete outputs) or …
normal prediction, can be easily formulated as per-pixel classification (discrete outputs) or …
Waymo open dataset: Panoramic video panoptic segmentation
Panoptic image segmentation is the computer vision task of finding groups of pixels in an
image and assigning semantic classes and object instance identifiers to them. Research in …
image and assigning semantic classes and object instance identifiers to them. Research in …