Review the state-of-the-art technologies of semantic segmentation based on deep learning

Y Mo, Y Wu, X Yang, F Liu, Y Liao - Neurocomputing, 2022 - Elsevier
The goal of semantic segmentation is to segment the input image according to semantic
information and predict the semantic category of each pixel from a given label set. With the …

Image segmentation using deep learning: A survey

S Minaee, Y Boykov, F Porikli, A Plaza… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …

Segnext: Rethinking convolutional attention design for semantic segmentation

MH Guo, CZ Lu, Q Hou, Z Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc
We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …

Encoder-based domain tuning for fast personalization of text-to-image models

R Gal, M Arar, Y Atzmon, AH Bermano… - ACM Transactions on …, 2023 - dl.acm.org
Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …

PIDNet: A real-time semantic segmentation network inspired by PID controllers

J Xu, Z **ong, SP Bhattacharyya - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Two-branch network architecture has shown its efficiency and effectiveness in real-time
semantic segmentation tasks. However, direct fusion of high-resolution details and low …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes

H Pan, Y Hong, W Sun, Y Jia - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
Using light-weight architectures or reasoning on low-resolution images, recent methods
realize very fast scene parsing, even running at more than 100 FPS on a single GPU …

Diffusionclip: Text-guided diffusion models for robust image manipulation

G Kim, T Kwon, JC Ye - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Recently, GAN inversion methods combined with Contrastive Language-Image Pretraining
(CLIP) enables zero-shot image manipulation guided by text prompts. However, their …

Sam-adapter: Adapting segment anything in underperformed scenes

T Chen, L Zhu, C Deng, R Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com
The emergence of large models, also known as foundation models, has brought significant
advancements to AI research. One such model is Segment Anything (SAM), which is …

Topformer: Token pyramid transformer for mobile semantic segmentation

W Zhang, Z Huang, G Luo, T Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Although vision transformers (ViTs) have achieved great success in computer vision, the
heavy computational cost hampers their applications to dense prediction tasks such as …