Semantic image segmentation: Two decades of research

G Csurka, R Volpi, B Chidlovskii - Foundations and Trends® …, 2022 - nowpublishers.com
Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer
vision applications, providing key information for the global understanding of an image. This …

A comprehensive review of modern object segmentation approaches

Y Wang, U Ahsan, H Li, M Hagen - Foundations and Trends® …, 2022 - nowpublishers.com
Image segmentation is the task of associating pixels in an image with their respective object
class labels. It has a wide range of applications in many industries including healthcare …

Max-deeplab: End-to-end panoptic segmentation with mask transformers

H Wang, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present MaX-DeepLab, the first end-to-end model for panoptic segmentation.
Our approach simplifies the current pipeline that depends heavily on surrogate sub-tasks …

Axial-deeplab: Stand-alone axial-attention for panoptic segmentation

H Wang, Y Zhu, B Green, H Adam, A Yuille… - European conference on …, 2020 - Springer
Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …

Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution

S Qiao, LC Chen, A Yuille - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Many modern object detectors demonstrate outstanding performances by using the
mechanism of looking and thinking twice. In this paper, we explore this mechanism in the …

Higherhrnet: Scale-aware representation learning for bottom-up human pose estimation

B Cheng, B **ao, J Wang, H Shi… - Proceedings of the …, 2020 - openaccess.thecvf.com
Bottom-up human pose estimation methods have difficulties in predicting the correct pose for
small persons due to challenges in scale variation. In this paper, we present HigherHRNet: a …

Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation

B Cheng, MD Collins, Y Zhu, T Liu… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …

Hierarchical multi-scale attention for semantic segmentation

A Tao, K Sapra, B Catanzaro - arxiv preprint arxiv:2005.10821, 2020 - arxiv.org
Multi-scale inference is commonly used to improve the results of semantic segmentation.
Multiple images scales are passed through a network and then the results are combined …

You only segment once: Towards real-time panoptic segmentation

J Hu, L Huang, T Ren, S Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose YOSO, a real-time panoptic segmentation framework. YOSO
predicts masks via dynamic convolutions between panoptic kernels and image feature …

Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Q Yu, H Wang, D Kim, S Qiao… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …