- Academic Search

F Shamshad, S Khan, SW Zamir, MH Khan… - Medical image …, 2023 - Elsevier

Following unprecedented success on the natural language tasks, Transformers have been
successfully applied to several computer vision problems, achieving state-of-the-art results …

Opslaan Citeren Geciteerd door 766 Verwante artikelen Alle 10 versies

[Free GPT-4]
[DeepSeek]

[PDF] sciencedirect.com

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

J Li, J Chen, Y Tang, C Wang, BA Landman… - Medical image …, 2023 - Elsevier

Transformer, one of the latest technological advances of deep learning, has gained
prevalence in natural language processing or computer vision. Since medical imaging bear …

Opslaan Citeren Geciteerd door 218 Verwante artikelen Alle 8 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Transformers are ssms: Generalized models and efficient algorithms through structured state space duality

T Dao, A Gu - arxiv preprint arxiv:2405.21060, 2024 - arxiv.org

While Transformers have been the main architecture behind deep learning's success in
language modeling, state-space models (SSMs) such as Mamba have recently been shown …

Opslaan Citeren Geciteerd door 315 Verwante artikelen Alle 6 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Flatten transformer: Vision transformer using focused linear attention

D Han, X Pan, Y Han, S Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

The quadratic computation complexity of self-attention has been a persistent challenge
when applying Transformer models to vision tasks. Linear attention, on the other hand, offers …

Opslaan Citeren Geciteerd door 191 Verwante artikelen Alle 6 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Flashattention-3: Fast and accurate attention with asynchrony and low-precision

J Shah, G Bikshandi, Y Zhang… - Advances in …, 2025 - proceedings.neurips.cc

Attention, as a core layer of the ubiquitous Transformer architecture, is the bottleneck for
large language models and long-context applications. elaborated an approach to speed up …

Opslaan Citeren Geciteerd door 57 Verwante artikelen Alle 5 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Simplified state space layers for sequence modeling

JTH Smith, A Warrington, SW Linderman - arxiv preprint arxiv:2208.04933, 2022 - arxiv.org

Models using structured state space sequence (S4) layers have achieved state-of-the-art
performance on long-range sequence modeling tasks. An S4 layer combines linear state …

Opslaan Citeren Geciteerd door 490 Verwante artikelen Alle 5 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Flashattention: Fast and memory-efficient exact attention with io-awareness

T Dao, D Fu, S Ermon, A Rudra… - Advances in neural …, 2022 - proceedings.neurips.cc

Transformers are slow and memory-hungry on long sequences, since the time and memory
complexity of self-attention are quadratic in sequence length. Approximate attention …

Opslaan Citeren Geciteerd door 1792 Verwante artikelen Alle 10 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer

We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Opslaan Citeren Geciteerd door 412 Verwante artikelen Alle 8 versies

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Mobilellm: Optimizing sub-billion parameter language models for on-device use cases

Z Liu, C Zhao, F Iandola, C Lai, Y Tian… - … on Machine Learning, 2024 - openreview.net

This paper addresses the growing need for efficient large language models (LLMs) on
mobile devices, driven by increasing cloud costs and latency concerns. We focus on …

Opslaan Citeren Geciteerd door 77 Verwante artikelen Alle 8 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Agent attention: On the integration of softmax and linear attention

D Han, T Ye, Y Han, Z **a, S Pan, P Wan… - … on Computer Vision, 2024 - Springer

The attention module is the key component in Transformers. While the global attention
mechanism offers high expressiveness, its excessive computational cost restricts its …

Opslaan Citeren Geciteerd door 83 Verwante artikelen Alle 9 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Nyströmformer: A nyström-based algorithm for approximating self-attention

Transformers in medical imaging: A survey

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Transformers are ssms: Generalized models and efficient algorithms through structured state space duality

Flatten transformer: Vision transformer using focused linear attention

Flashattention-3: Fast and accurate attention with asynchrony and low-precision

Simplified state space layers for sequence modeling

Flashattention: Fast and memory-efficient exact attention with io-awareness

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

Mobilellm: Optimizing sub-billion parameter language models for on-device use cases

Agent attention: On the integration of softmax and linear attention