Google Académico

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer

Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

Guardar Citar Citado por 1898 Artículos relacionados Las 8 versiones

[Free GPT-4]

[PDF] arxiv.org

Semantic communications for future internet: Fundamentals, applications, and challenges

W Yang, H Du, ZQ Liew, WYB Lim… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org

With the increasing demand for intelligent services, the sixth-generation (6G) wireless
networks will shift from a traditional architecture that focuses solely on a high transmission …

Guardar Citar Citado por 327 Artículos relacionados Las 4 versiones

[Free GPT-4]

[PDF] springer.com

Visual attention network

MH Guo, CZ Lu, ZN Liu, MM Cheng, SM Hu - Computational Visual Media, 2023 - Springer

While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …

Guardar Citar Citado por 793 Artículos relacionados Las 8 versiones

[Free GPT-4]

[HTML] mdpi.com

[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

L Chen, S Li, Q Bai, J Yang, S Jiang, Y Miao - Remote Sensing, 2021 - mdpi.com

Image classification has always been a hot research direction in the world, and the
emergence of deep learning has promoted the development of this field. Convolutional …

Guardar Citar Citado por 598 Artículos relacionados Las 5 versiones En caché

[Free GPT-4]

[PDF] thecvf.com

Cmt: Convolutional neural networks meet vision transformers

J Guo, K Han, H Wu, Y Tang, X Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Vision transformers have been successfully applied to image recognition tasks due to their
ability to capture long-range dependencies within an image. However, there are still gaps in …

Guardar Citar Citado por 866 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Global attention mechanism: Retain information to enhance channel-spatial interactions

Y Liu, Z Shao, N Hoffmann - arxiv preprint arxiv:2112.05561, 2021 - arxiv.org

A variety of attention mechanisms have been studied to improve the performance of various
computer vision tasks. However, the prior methods overlooked the significance of retaining …

Guardar Citar Citado por 654 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]

[PDF] mlr.press

Simam: A simple, parameter-free attention module for convolutional neural networks

L Yang, RY Zhang, L Li, X **e - International conference on …, 2021 - proceedings.mlr.press

In this paper, we propose a conceptually simple but very effective attention module for
Convolutional Neural Networks (ConvNets). In contrast to existing channel-wise and spatial …

Guardar Citar Citado por 1357 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Real-time scene text detection with differentiable binarization and adaptive scale fusion

M Liao, Z Zou, Z Wan, C Yao… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Recently, segmentation-based scene text detection methods have drawn extensive attention
in the scene text detection field, because of their superiority in detecting the text instances of …

Guardar Citar Citado por 323 Artículos relacionados Las 7 versiones

A review on the attention mechanism of deep learning

Z Niu, G Zhong, H Yu - Neurocomputing, 2021 - Elsevier

Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …

Guardar Citar Citado por 2406 Artículos relacionados Las 4 versiones

[Free GPT-4]

[PDF] neurips.cc

Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text

H Akbari, L Yuan, R Qian… - Advances in …, 2021 - proceedings.neurips.cc

We present a framework for learning multimodal representations from unlabeled data using
convolution-free Transformer architectures. Specifically, our Video-Audio-Text Transformer …

Guardar Citar Citado por 693 Artículos relacionados Las 9 versiones Versión en HTML

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Residual attention network for image classification

Attention mechanisms in computer vision: A survey

Semantic communications for future internet: Fundamentals, applications, and challenges

Visual attention network

[HTML][HTML] Review of image classification algorithms based on convolutional neural networks

Cmt: Convolutional neural networks meet vision transformers

Global attention mechanism: Retain information to enhance channel-spatial interactions

Simam: A simple, parameter-free attention module for convolutional neural networks

Real-time scene text detection with differentiable binarization and adaptive scale fusion

A review on the attention mechanism of deep learning

Vatt: Transformers for multimodal self-supervised learning from raw video, audio and text