- Academic Search

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

J Li, J Chen, Y Tang, C Wang, BA Landman… - Medical image …, 2023 - Elsevier

Transformer, one of the latest technological advances of deep learning, has gained
prevalence in natural language processing or computer vision. Since medical imaging bear …

Save Cite Cited by 213 Related articles All 9 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Mamba: Linear-time sequence modeling with selective state spaces

A Gu, T Dao - arxiv preprint arxiv:2312.00752, 2023 - arxiv.org

Foundation models, now powering most of the exciting applications in deep learning, are
almost universally based on the Transformer architecture and its core attention module …

Save Cite Cited by 2005 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A survey of techniques for optimizing transformer inference

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023 - Elsevier

Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …

Save Cite Cited by 68 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Flatten transformer: Vision transformer using focused linear attention

D Han, X Pan, Y Han, S Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

The quadratic computation complexity of self-attention has been a persistent challenge
when applying Transformer models to vision tasks. Linear attention, on the other hand, offers …

Save Cite Cited by 179 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] mlr.press

Structure-aware transformer for graph representation learning

D Chen, L O'Bray, K Borgwardt - … Conference on Machine …, 2022 - proceedings.mlr.press

The Transformer architecture has gained growing attention in graph representation learning
recently, as it naturally overcomes several limitations of graph neural networks (GNNs) by …

Save Cite Cited by 329 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Spike-driven transformer

M Yao, J Hu, Z Zhou, L Yuan, Y Tian… - Advances in neural …, 2024 - proceedings.neurips.cc

Abstract Spiking Neural Networks (SNNs) provide an energy-efficient deep learning option
due to their unique spike-based event-driven (ie, spike-driven) paradigm. In this paper, we …

Save Cite Cited by 115 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Spikformer: When spiking neural network meets transformer

Z Zhou, Y Zhu, C He, Y Wang, S Yan, Y Tian… - arxiv preprint arxiv …, 2022 - arxiv.org

We consider two biologically plausible structures, the Spiking Neural Network (SNN) and the
self-attention mechanism. The former offers an energy-efficient and event-driven paradigm …

Save Cite Cited by 256 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Hierarchically gated recurrent neural network for sequence modeling

Z Qin, S Yang, Y Zhong - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Transformers have surpassed RNNs in popularity due to their superior abilities in parallel
training and long-term dependency modeling. Recently, there has been a renewed interest …

Save Cite Cited by 65 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

Mb-taylorformer: Multi-branch efficient transformer expanded by taylor formula for image dehazing

Y Qiu, K Zhang, C Wang, W Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

In recent years, Transformer networks are beginning to replace pure convolutional neural
networks (CNNs) in the field of computer vision due to their global receptive field and …

Save Cite Cited by 91 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Long range language modeling via gated state spaces

H Mehta, A Gupta, A Cutkosky, B Neyshabur - arxiv preprint arxiv …, 2022 - arxiv.org

State space models have shown to be effective at modeling long range dependencies,
specially on sequence classification tasks. In this work we focus on autoregressive …

Save Cite Cited by 227 Related articles All 6 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

cosformer: Rethinking softmax in attention

Transforming medical imaging with Transformers? A comparative review of key properties, current progresses, and future perspectives

Mamba: Linear-time sequence modeling with selective state spaces

A survey of techniques for optimizing transformer inference

Flatten transformer: Vision transformer using focused linear attention

Structure-aware transformer for graph representation learning

Spike-driven transformer

Spikformer: When spiking neural network meets transformer

Hierarchically gated recurrent neural network for sequence modeling

Mb-taylorformer: Multi-branch efficient transformer expanded by taylor formula for image dehazing

Long range language modeling via gated state spaces