Attention mechanisms in computer vision: A survey

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com
Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Efficient multi-scale attention module with cross-spatial learning

D Ouyang, S He, G Zhang, M Luo… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Remarkable effectiveness of the channel or spatial attention mechanisms for producing
more discernible feature representation are illustrated in various computer vision tasks …

GhostNetv2: Enhance cheap operation with long-range attention

Y Tang, K Han, J Guo, C Xu, C Xu… - Advances in Neural …, 2022 - proceedings.neurips.cc
Light-weight convolutional neural networks (CNNs) are specially designed for applications
on mobile devices with faster inference speed. The convolutional operation can only capture …

Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles

H Li, J Li, H Wei, Z Liu, Z Zhan, Q Ren - arxiv preprint arxiv:2206.02424, 2022 - arxiv.org
Object detection is a significant downstream task in computer vision. For the on-board edge
computing platforms, a giant model is difficult to achieve the real-time detection requirement …

Deep model reassembly

X Yang, D Zhou, S Liu, J Ye… - Advances in neural …, 2022 - proceedings.neurips.cc
In this paper, we explore a novel knowledge-transfer task, termed as Deep Model
Reassembly (DeRy), for general-purpose model reuse. Given a collection of heterogeneous …

A lightweight vehicles detection network model based on YOLOv5

X Dong, S Yan, C Duan - Engineering Applications of Artificial Intelligence, 2022 - Elsevier
Vehicle detection technology is of great significance for realizing automatic monitoring and
AI-assisted driving systems. The state-of-the-art object detection method, namely, a class of …

Ds-transunet: Dual swin transformer u-net for medical image segmentation

A Lin, B Chen, J Xu, Z Zhang, G Lu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Automatic medical image segmentation has made great progress owing to powerful deep
representation learning. Inspired by the success of self-attention mechanism in transformer …

Seaformer: Squeeze-enhanced axial transformer for mobile semantic segmentation

Q Wan, Z Huang, J Lu, YU Gang… - The eleventh international …, 2023 - openreview.net
Since the introduction of Vision Transformers, the landscape of many computer vision tasks
(eg, semantic segmentation), which has been overwhelmingly dominated by CNNs, recently …

Deepvit: Towards deeper vision transformer

D Zhou, B Kang, X **, L Yang, X Lian, Z Jiang… - arxiv preprint arxiv …, 2021 - arxiv.org
Vision transformers (ViTs) have been successfully applied in image classification tasks
recently. In this paper, we show that, unlike convolution neural networks (CNNs) that can be …