Attention mechanisms in computer vision: A survey
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …
this observation, attention mechanisms were introduced into computer vision with the aim of …
A review on the attention mechanism of deep learning
Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …
Segnext: Rethinking convolutional attention design for semantic segmentation
We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …
segmentation. Recent transformer-based models have dominated the field of se-mantic …
Visual attention network
While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …
mechanism has recently taken various computer vision areas by storm. However, the 2D …
2dpass: 2d priors assisted semantic segmentation on lidar point clouds
As camera and LiDAR sensors capture complementary information in autonomous driving,
great efforts have been made to conduct semantic segmentation through multi-modality data …
great efforts have been made to conduct semantic segmentation through multi-modality data …
Semi-supervised semantic segmentation with cross pseudo supervision
In this paper, we study the semi-supervised semantic segmentation problem via exploring
both labeled data and extra unlabeled data. We propose a novel consistency regularization …
both labeled data and extra unlabeled data. We propose a novel consistency regularization …
Segmenter: Transformer for semantic segmentation
Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …
requires contextual information to reach label consensus. In this paper we introduce …
Swin transformer: Hierarchical vision transformer using shifted windows
This paper presents a new vision Transformer, called Swin Transformer, that capably serves
as a general-purpose backbone for computer vision. Challenges in adapting Transformer …
as a general-purpose backbone for computer vision. Challenges in adapting Transformer …
Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers
Most recent semantic segmentation methods adopt a fully-convolutional network (FCN) with
an encoder-decoder architecture. The encoder progressively reduces the spatial resolution …
an encoder-decoder architecture. The encoder progressively reduces the spatial resolution …
Sa-net: Shuffle attention for deep convolutional neural networks
Attention mechanisms, which enable a neural network to accurately focus on all the relevant
elements of the input, have become an essential component to improve the performance of …
elements of the input, have become an essential component to improve the performance of …