محقق Google

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024‏ - Elsevier‏

The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …‏

ذخیره ارجاع بیان شده در 155 یافته مقاله‌های مربوط تمام نسخه‌های 8

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives‏

J Li, J Chen, Y Tang, C Wang, BA Landman… - Medical image …, 2023‏ - Elsevier‏

Transformer, one of the latest technological advances of deep learning, has gained
prevalence in natural language processing or computer vision. Since medical imaging bear …‏

ذخیره ارجاع بیان شده در 222 یافته مقاله‌های مربوط تمام نسخه‌های 8

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Vmamba: Visual state space model‏

Y Liu, Y Tian, Y Zhao, H Yu, L **e… - Advances in neural …, 2025‏ - proceedings.neurips.cc‏

Designing computationally efficient network architectures remains an ongoing necessity in
computer vision. In this paper, we adapt Mamba, a state-space language model, into …‏

ذخیره ارجاع بیان شده در 1100 یافته مقاله‌های مربوط تمام نسخه‌های 12 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Run, don't walk: chasing higher FLOPS for faster neural networks‏

J Chen, S Kao, H He, W Zhuo, S Wen… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

To design fast neural networks, many works have been focusing on reducing the number of
floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does …‏

ذخیره ارجاع بیان شده در 1266 یافته مقاله‌های مربوط تمام نسخه‌های 11 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Efficientvit: Memory efficient vision transformer with cascaded group attention‏

X Liu, H Peng, N Zheng, Y Yang… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Vision transformers have shown great success due to their high model capabilities.
However, their remarkable performance is accompanied by heavy computation costs, which …‏

ذخیره ارجاع بیان شده در 373 یافته مقاله‌های مربوط تمام نسخه‌های 11 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Large selective kernel network for remote sensing object detection‏

Y Li, Q Hou, Z Zheng, MM Cheng… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Recent research on remote sensing object detection has largely focused on improving the
representation of oriented bounding boxes but has overlooked the unique prior knowledge …‏

ذخیره ارجاع بیان شده در 374 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rwkv: Reinventing rnns for the transformer era‏

B Peng, E Alcaide, Q Anthony, A Albalak… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Transformers have revolutionized almost all natural language processing (NLP) tasks but
suffer from memory and computational complexity that scales quadratically with sequence …‏

ذخیره ارجاع بیان شده در 477 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Repvit: Revisiting mobile cnn from vit perspective‏

A Wang, H Chen, Z Lin, J Han… - Proceedings of the IEEE …, 2024‏ - openaccess.thecvf.com‏

Abstract Recently lightweight Vision Transformers (ViTs) demonstrate superior performance
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …‏

ذخیره ارجاع بیان شده در 214 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Unireplknet: A universal perception large-kernel convnet for audio video point cloud time-series and image recognition‏

X Ding, Y Zhang, Y Ge, S Zhao… - Proceedings of the …, 2024‏ - openaccess.thecvf.com‏

Large-kernel convolutional neural networks (ConvNets) have recently received extensive
research attention but two unresolved and critical issues demand further investigation. 1) …‏

ذخیره ارجاع بیان شده در 143 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Internimage: Exploring large-scale vision foundation models with deformable convolutions‏

W Wang, J Dai, Z Chen, Z Huang, Z Li… - Proceedings of the …, 2023‏ - openaccess.thecvf.com‏

Compared to the great progress of large-scale vision transformers (ViTs) in recent years,
large-scale models based on convolutional neural networks (CNNs) are still in an early …‏

ذخیره ارجاع بیان شده در 847 یافته مقاله‌های مربوط تمام نسخه‌های 10 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Metaformer is actually what you need for vision

Advances in medical image analysis with vision transformers: a comprehensive review‏

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives‏

Vmamba: Visual state space model‏

Run, don't walk: chasing higher FLOPS for faster neural networks‏

Efficientvit: Memory efficient vision transformer with cascaded group attention‏

Large selective kernel network for remote sensing object detection‏

Rwkv: Reinventing rnns for the transformer era‏

Repvit: Revisiting mobile cnn from vit perspective‏

Unireplknet: A universal perception large-kernel convnet for audio video point cloud time-series and image recognition‏

Internimage: Exploring large-scale vision foundation models with deformable convolutions‏