Google Tudós

M Awais, M Naseer, S Khan, RM Anwer… - … on Pattern Analysis …, 2025 - ieeexplore.ieee.org

Vision systems that see and reason about the compositional nature of visual scenes are
fundamental to understanding our world. The complex relations between objects and their …

Mentés Hivatkozás Idézetek száma: 135 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]

[PDF] arxiv.org

Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier

The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Mentés Hivatkozás Idézetek száma: 148 Kapcsolódó cikkek Mind a(z) 7 változat

[Free GPT-4]

[PDF] thecvf.com

Biformer: Vision transformer with bi-level routing attention

L Zhu, X Wang, Z Ke, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

As the core building block of vision transformers, attention is a powerful tool to capture long-
range dependency. However, such power comes at a cost: it incurs a huge computation …

Mentés Hivatkozás Idézetek száma: 708 Kapcsolódó cikkek Mind a(z) 10 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

Vision mamba: Efficient visual representation learning with bidirectional state space model

L Zhu, B Liao, Q Zhang, X Wang, W Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

Recently the state space models (SSMs) with efficient hardware-aware designs, ie, the
Mamba deep learning model, have shown great potential for long sequence modeling …

Mentés Hivatkozás Idézetek száma: 1016 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]

[PDF] thecvf.com

Internimage: Exploring large-scale vision foundation models with deformable convolutions

W Wang, J Dai, Z Chen, Z Huang, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

Compared to the great progress of large-scale vision transformers (ViTs) in recent years,
large-scale models based on convolutional neural networks (CNNs) are still in an early …

Mentés Hivatkozás Idézetek száma: 798 Kapcsolódó cikkek Mind a(z) 8 változat HTML-változat

[Free GPT-4]

[PDF] thecvf.com

Flatten transformer: Vision transformer using focused linear attention

D Han, X Pan, Y Han, S Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

The quadratic computation complexity of self-attention has been a persistent challenge
when applying Transformer models to vision tasks. Linear attention, on the other hand, offers …

Mentés Hivatkozás Idézetek száma: 181 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]

[PDF] neurips.cc

Point transformer v2: Grouped vector attention and partition-based pooling

X Wu, Y Lao, L Jiang, X Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

As a pioneering work exploring transformer architecture for 3D point cloud understanding,
Point Transformer achieves impressive results on multiple highly competitive benchmarks. In …

Mentés Hivatkozás Idézetek száma: 380 Kapcsolódó cikkek Mind a(z) 8 változat HTML-változat

[Free GPT-4]

[PDF] thecvf.com

Activating more pixels in image super-resolution transformer

X Chen, X Wang, J Zhou, Y Qiao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Transformer-based methods have shown impressive performance in low-level vision tasks,
such as image super-resolution. However, we find that these networks can only utilize a …

Mentés Hivatkozás Idézetek száma: 758 Kapcsolódó cikkek Mind a(z) 9 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

Vision transformer adapter for dense predictions

Z Chen, Y Duan, W Wang, J He, T Lu, J Dai… - arxiv preprint arxiv …, 2022 - arxiv.org

This work investigates a simple yet powerful adapter for Vision Transformer (ViT). Unlike
recent visual transformers that introduce vision-specific inductive biases into their …

Mentés Hivatkozás Idézetek száma: 615 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]

[PDF] thecvf.com

Spherical transformer for lidar-based 3d recognition

X Lai, Y Chen, F Lu, J Liu, J Jia - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

LiDAR-based 3D point cloud recognition has benefited various applications. Without
specially considering the LiDAR point distribution, most current methods suffer from …

Mentés Hivatkozás Idézetek száma: 151 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Cswin transformer: A general vision transformer backbone with cross-shaped windows

Foundation Models Defining a New Era in Vision: a Survey and Outlook

Advances in medical image analysis with vision transformers: a comprehensive review

Biformer: Vision transformer with bi-level routing attention

Vision mamba: Efficient visual representation learning with bidirectional state space model

Internimage: Exploring large-scale vision foundation models with deformable convolutions

Flatten transformer: Vision transformer using focused linear attention

Point transformer v2: Grouped vector attention and partition-based pooling

Activating more pixels in image super-resolution transformer

Vision transformer adapter for dense predictions

Spherical transformer for lidar-based 3d recognition