Attention mechanisms in computer vision: A survey

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Convnext v2: Co-designing and scaling convnets with masked autoencoders

S Woo, S Debnath, R Hu, X Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Driven by improved architectures and better representation learning frameworks, the field of
visual recognition has enjoyed rapid modernization and performance boost in the early …

Large selective kernel network for remote sensing object detection

Y Li, Q Hou, Z Zheng, MM Cheng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent research on remote sensing object detection has largely focused on improving the
representation of oriented bounding boxes but has overlooked the unique prior knowledge …

UIU-Net: U-Net in U-Net for infrared small object detection

X Wu, D Hong, J Chanussot - IEEE Transactions on Image …, 2022 - ieeexplore.ieee.org
Learning-based infrared small object detection methods currently rely heavily on the
classification backbone network. This tends to result in tiny object loss and feature …

Visual attention network

MH Guo, CZ Lu, ZN Liu, MM Cheng, SM Hu - Computational Visual Media, 2023 - Springer
While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …

On the integration of self-attention and convolution

X Pan, C Ge, R Lu, S Song, G Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Convolution and self-attention are two powerful techniques for representation learning, and
they are usually considered as two peer approaches that are distinct from each other. In this …

Cmt: Convolutional neural networks meet vision transformers

J Guo, K Han, H Wu, Y Tang, X Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Vision transformers have been successfully applied to image recognition tasks due to their
ability to capture long-range dependencies within an image. However, there are still gaps in …

Global attention mechanism: Retain information to enhance channel-spatial interactions

Y Liu, Z Shao, N Hoffmann - arxiv preprint arxiv:2112.05561, 2021 - arxiv.org
A variety of attention mechanisms have been studied to improve the performance of various
computer vision tasks. However, the prior methods overlooked the significance of retaining …

Coordinate attention for efficient mobile network design

Q Hou, D Zhou, J Feng - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Recent studies on mobile network design have demonstrated the remarkable effectiveness
of channel attention (eg, the Squeeze-and-Excitation attention) for lifting model performance …