Techniques and challenges of image segmentation: A review

Y Yu, C Wang, Q Fu, R Kou, F Huang, B Yang, T Yang… - Electronics, 2023 - mdpi.com
Image segmentation, which has become a research hotspot in the field of image processing
and computer vision, refers to the process of dividing an image into meaningful and non …

Attention mechanisms in computer vision: A survey

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

Lisa: Reasoning segmentation via large language model

X Lai, Z Tian, Y Chen, Y Li, Y Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com
Although perception systems have made remarkable advancements in recent years they still
rely on explicit human instruction or pre-defined categories to identify the target objects …

Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes

H Pan, Y Hong, W Sun, Y Jia - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
Using light-weight architectures or reasoning on low-resolution images, recent methods
realize very fast scene parsing, even running at more than 100 FPS on a single GPU …

Scale-mae: A scale-aware masked autoencoder for multiscale geospatial representation learning

CJ Reed, R Gupta, S Li, S Brockman… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large, pretrained models are commonly finetuned with imagery that is heavily augmented to
mimic different conditions and scales, with the resulting models used for various tasks with …

Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery

Y Cong, S Khanna, C Meng, P Liu… - Advances in …, 2022 - proceedings.neurips.cc
Unsupervised pre-training methods for large vision models have shown to enhance
performance on downstream supervised tasks. Develo** similar techniques for satellite …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Rethinking semantic segmentation: A prototype view

T Zhou, W Wang, E Konukoglu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …

Hrda: Context-aware high-resolution domain-adaptive semantic segmentation

L Hoyer, D Dai, L Van Gool - European conference on computer vision, 2022 - Springer
Unsupervised domain adaptation (UDA) aims to adapt a model trained on the source
domain (eg synthetic data) to the target domain (eg real-world data) without requiring further …

SegFormer: Simple and efficient design for semantic segmentation with transformers

E **e, W Wang, Z Yu, A Anandkumar… - Advances in neural …, 2021 - proceedings.neurips.cc
We present SegFormer, a simple, efficient yet powerful semantic segmentation framework
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …