Μελετητής Google

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier

The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 145 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]

[PDF] springer.com

Feature dimensionality reduction: a review

W Jia, M Sun, J Lian, S Hou - Complex & Intelligent Systems, 2022 - Springer

As basic research, it has also received increasing attention from people that the “curse of
dimensionality” will lead to increase the cost of data storage and computing; it also …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 482 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

Run, don't walk: chasing higher FLOPS for faster neural networks

J Chen, S Kao, H He, W Zhuo, S Wen… - Proceedings of the …, 2023 - openaccess.thecvf.com

To design fast neural networks, many works have been focusing on reducing the number of
floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1175 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] arxiv.org

Vision mamba: Efficient visual representation learning with bidirectional state space model

L Zhu, B Liao, Q Zhang, X Wang, W Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Recently the state space models (SSMs) with efficient hardware-aware designs, ie, the
Mamba deep learning model, have shown great potential for long sequence modeling …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1011 Σχετικά άρθρα Όλες οι 5 εκδοχές Προβολή ως HTML

[Free GPT-4]

[HTML] ieee-jas.net

SwinFusion: Cross-domain long-range learning for general image fusion via swin transformer

J Ma, L Tang, F Fan, J Huang, X Mei… - IEEE/CAA Journal of …, 2022 - ieeexplore.ieee.org

This study proposes a novel general image fusion framework based on cross-domain long-
range learning and Swin Transformer, termed as SwinFusion. On the one hand, an attention …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 693 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]

[PDF] neurips.cc

Vision gnn: An image is worth graph of nodes

K Han, Y Wang, J Guo, Y Tang… - Advances in neural …, 2022 - proceedings.neurips.cc

Network architecture plays a key role in the deep learning-based computer vision system.
The widely-used convolutional neural network and transformer treat the image as a grid or …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 421 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] thecvf.com

Scaling up your kernels to 31x31: Revisiting large kernel design in cnns

X Ding, X Zhang, J Han, G Ding - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1124 Σχετικά άρθρα Όλες οι 10 εκδοχές Προβολή ως HTML

[Free GPT-4]

[PDF] springer.com

Visual attention network

MH Guo, CZ Lu, ZN Liu, MM Cheng, SM Hu - Computational Visual Media, 2023 - Springer

While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 793 Σχετικά άρθρα Όλες οι 8 εκδοχές

An effective CNN and Transformer complementary network for medical image segmentation

F Yuan, Z Zhang, Z Fang - Pattern Recognition, 2023 - Elsevier

The Transformer network was originally proposed for natural language processing. Due to
its powerful representation ability for long-range dependency, it has been extended for …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 305 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]

[PDF] thecvf.com

A convnet for the 2020s

Z Liu, H Mao, CY Wu, C Feichtenhofer… - Proceedings of the …, 2022 - openaccess.thecvf.com

The" Roaring 20s" of visual recognition began with the introduction of Vision Transformers
(ViTs), which quickly superseded ConvNets as the state-of-the-art image classification …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 6922 Σχετικά άρθρα Όλες οι 11 εκδοχές Προβολή ως HTML

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Bottleneck transformers for visual recognition

Advances in medical image analysis with vision transformers: a comprehensive review

Feature dimensionality reduction: a review

Run, don't walk: chasing higher FLOPS for faster neural networks

Vision mamba: Efficient visual representation learning with bidirectional state space model

SwinFusion: Cross-domain long-range learning for general image fusion via swin transformer

Vision gnn: An image is worth graph of nodes

Scaling up your kernels to 31x31: Revisiting large kernel design in cnns

Visual attention network

An effective CNN and Transformer complementary network for medical image segmentation

A convnet for the 2020s