Študovňa Google

D Soydaner - Neural Computing and Applications, 2022 - Springer

A long time ago in the machine learning literature, the idea of incorporating a mechanism
inspired by the human visual system into neural networks was introduced. This idea is …

Uložiť Citovať Citované 184-krát Súvisiace články Všetky verzie 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on label-efficient deep image segmentation: Bridging the gap between weak supervision and dense prediction

W Shen, Z Peng, X Wang, H Wang… - IEEE transactions on …, 2023 - ieeexplore.ieee.org

The rapid development of deep learning has made a great progress in image segmentation,
one of the fundamental tasks of computer vision. However, the current segmentation …

Uložiť Citovať Citované 91-krát Súvisiace články Všetky verzie 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Vision transformers for single image dehazing

Y Song, Z He, H Qian, X Du - IEEE Transactions on Image …, 2023 - ieeexplore.ieee.org

Image dehazing is a representative low-level vision task that estimates latent haze-free
images from hazy images. In recent years, convolutional neural network-based methods …

Uložiť Citovať Citované 600-krát Súvisiace články Všetky verzie 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Davit: Dual attention vision transformers

M Ding, B **ao, N Codella, P Luo, J Wang… - European conference on …, 2022 - Springer

In this work, we introduce Dual Attention Vision Transformers (DaViT), a simple yet effective
vision transformer architecture that is able to capture global context while maintaining …

Uložiť Citovať Citované 359-krát Súvisiace články Všetky verzie 5

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dilateformer: Multi-scale dilated transformer for visual recognition

J Jiao, YM Tang, KY Lin, Y Gao, AJ Ma… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

As a de facto solution, the vanilla Vision Transformers (ViTs) are encouraged to model long-
range dependencies between arbitrary image patches while the global attended receptive …

Uložiť Citovať Citované 138-krát Súvisiace články Všetky verzie 6

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Mpvit: Multi-path vision transformer for dense prediction

Y Lee, J Kim, J Willette… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Dense computer vision tasks such as object detection and segmentation require effective
multi-scale feature representation for detecting or classifying objects or regions with varying …

Uložiť Citovať Citované 340-krát Súvisiace články Všetky verzie 9 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

N-gram in swin transformers for efficient lightweight image super-resolution

H Choi, J Lee, J Yang - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com

While some studies have proven that Swin Transformer (Swin) with window self-attention
(WSA) is suitable for single image super-resolution (SR), the plain WSA ignores the broad …

Uložiť Citovať Citované 127-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Multi-scale high-resolution vision transformer for semantic segmentation

J Gu, H Kwon, D Wang, W Ye, M Li… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract Vision Transformers (ViTs) have emerged with superior performance on computer
vision tasks compared to convolutional neural network (CNN)-based models. However, ViTs …

Uložiť Citovať Citované 259-krát Súvisiace články Všetky verzie 12 Vyhľadávanie knižnice HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Spvit: Enabling faster vision transformers via latency-aware soft token pruning

Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun… - European conference on …, 2022 - Springer

Abstract Recently, Vision Transformer (ViT) has continuously established new milestones in
the computer vision field, while the high computation and memory cost makes its …

Uložiť Citovať Citované 205-krát Súvisiace články Všetky verzie 6

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Accurate image restoration with attention retractable transformer

J Zhang, Y Zhang, J Gu, Y Zhang, L Kong… - arxiv preprint arxiv …, 2022 - arxiv.org

Recently, Transformer-based image restoration networks have achieved promising
improvements over convolutional neural networks due to parameter-independent global …

Uložiť Citovať Citované 116-krát Súvisiace články Všetky verzie 4 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Glance-and-gaze vision transformer

Attention mechanism in neural networks: where it comes and where it goes

A survey on label-efficient deep image segmentation: Bridging the gap between weak supervision and dense prediction

Vision transformers for single image dehazing

Davit: Dual attention vision transformers

Dilateformer: Multi-scale dilated transformer for visual recognition

Mpvit: Multi-path vision transformer for dense prediction

N-gram in swin transformers for efficient lightweight image super-resolution

Multi-scale high-resolution vision transformer for semantic segmentation

Spvit: Enabling faster vision transformers via latency-aware soft token pruning

Accurate image restoration with attention retractable transformer