Google Akademik

Z **a, D Han, Y Han, X Pan, S Song… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Generalized Referring Expression Segmentation (GRES) extends the scope of
classic RES to refer to multiple objects in one expression or identify the empty targets absent …

Kaydet Alıntı yap Alıntılanma sayısı: 40 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Efficient diffusion transformer with step-wise dynamic attention mediators

Y Pu, Z **a, J Guo, D Han, Q Li, D Li, Y Yuan… - … on Computer Vision, 2024 - Springer

This paper identifies significant redundancy in the query-key interactions within self-attention
mechanisms of diffusion transformer models, particularly during the early stages of …

Kaydet Alıntı yap Alıntılanma sayısı: 9 İlgili makaleler 7 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Mosaic: in-memory computing and routing for small-world spike-based neuromorphic systems

T Dalgaty, F Moro, Y Demirağ, A De Pra… - Nature …, 2024 - nature.com

The brain's connectivity is locally dense and globally sparse, forming a small-world graph—
a principle prevalent in the evolution of various species, suggesting a universal solution for …

Kaydet Alıntı yap Alıntılanma sayısı: 31 İlgili makaleler 12 sürümün hepsi

Ct-net: Asymmetric compound branch transformer for medical image segmentation

N Zhang, L Yu, D Zhang, W Wu, S Tian, X Kang, M Li - Neural Networks, 2024 - Elsevier

The Transformer architecture has been widely applied in the field of image segmentation
due to its powerful ability to capture long-range dependencies. However, its ability to capture …

Kaydet Alıntı yap Alıntılanma sayısı: 20 İlgili makaleler 4 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] ecva.net

Lookupvit: Compressing visual information to a limited number of tokens

R Koner, G Jain, P Jain, V Tresp, S Paul - European Conference on …, 2024 - Springer

Abstract Vision Transformers (ViT) have emerged as the de-facto choice for numerous
industry grade vision solutions. But their inference cost can be prohibitive for many settings …

Kaydet Alıntı yap Alıntılanma sayısı: 5 İlgili makaleler 10 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dat++: Spatially dynamic vision transformer with deformable attention

Z **a, X Pan, S Song, LE Li, G Huang - arxiv preprint arxiv:2309.01430, 2023 - arxiv.org

Transformers have shown superior performance on various vision tasks. Their large
receptive field endows Transformer models with higher representation power than their CNN …

Kaydet Alıntı yap Alıntılanma sayısı: 21 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] pkwyx.com

Efficient Vision Transformers with Partial Attention

XT Vo, DL Nguyen, A Priadana, KH Jo - European Conference on …, 2024 - Springer

As a core of Vision Transformer (ViT), self-attention has high versatility in modeling long-
range spatial interactions because every query attends to all spatial locations. Although ViT …

Kaydet Alıntı yap Alıntılanma sayısı: 2 İlgili makaleler 4 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

TransXNet: learning both global and local dynamics with a dual dynamic token mixer for visual recognition

M Lou, HY Zhou, S Yang, Y Yu - arxiv preprint arxiv:2310.19380, 2023 - arxiv.org

Recent studies have integrated convolution into transformers to introduce inductive bias and
improve generalization performance. However, the static nature of conventional convolution …

Kaydet Alıntı yap Alıntılanma sayısı: 46 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

ViT-MVT: A Unified Vision Transformer Network for Multiple Vision Tasks

T **e, K Dai, Z Jiang, R Li, S Mao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

In this work, we seek to learn multiple mainstream vision tasks concurrently using a unified
network, which is storage-efficient as numerous networks with task-shared parameters can …

Kaydet Alıntı yap Alıntılanma sayısı: 11 İlgili makaleler 3 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

MG-ViT: a multi-granularity method for compact and efficient vision transformers

Y Zhang, Y Liu, D Miao, Q Zhang… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Vision Transformer (ViT) faces obstacles in wide application due to its huge
computational cost. Almost all existing studies on compressing ViT adopt the manner of …

Kaydet Alıntı yap Alıntılanma sayısı: 10 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Slide-transformer: Hierarchical vision transformer with local self-attention

Gsva: Generalized segmentation via multimodal large language models

Efficient diffusion transformer with step-wise dynamic attention mediators

Mosaic: in-memory computing and routing for small-world spike-based neuromorphic systems

Ct-net: Asymmetric compound branch transformer for medical image segmentation

Lookupvit: Compressing visual information to a limited number of tokens

Dat++: Spatially dynamic vision transformer with deformable attention

Efficient Vision Transformers with Partial Attention

TransXNet: learning both global and local dynamics with a dual dynamic token mixer for visual recognition

ViT-MVT: A Unified Vision Transformer Network for Multiple Vision Tasks

MG-ViT: a multi-granularity method for compact and efficient vision transformers