Google Académico

S Jamil, M Jalil Piran, OJ Kwon - Drones, 2023 - mdpi.com

As a special type of transformer, vision transformers (ViTs) can be used for various computer
vision (CV) applications. Convolutional neural networks (CNNs) have several potential …

Guardar Citar Citado por 60 Artículos relacionados Las 8 versiones En caché

[Free GPT-4]

[PDF] thecvf.com

Flatten transformer: Vision transformer using focused linear attention

D Han, X Pan, Y Han, S Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

The quadratic computation complexity of self-attention has been a persistent challenge
when applying Transformer models to vision tasks. Linear attention, on the other hand, offers …

Guardar Citar Citado por 181 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Measuring and narrowing the compositionality gap in language models

O Press, M Zhang, S Min, L Schmidt, NA Smith… - arxiv preprint arxiv …, 2022 - arxiv.org

We investigate the ability of language models to perform compositional reasoning tasks
where the overall solution depends on correctly composing the answers to sub-problems …

Guardar Citar Citado por 337 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Vision transformer with deformable attention

Z **a, X Pan, S Song, LE Li… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Transformers have recently shown superior performances on various vision tasks. The large,
sometimes even global, receptive field endows Transformer models with higher …

Guardar Citar Citado por 657 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …

Guardar Citar Citado por 456 Artículos relacionados Las 22 versiones

[Free GPT-4]

[PDF] arxiv.org

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Guardar Citar Citado por 787 Artículos relacionados Las 7 versiones

[Free GPT-4]

[PDF] thecvf.com

Adaptive rotated convolution for rotated object detection

Y Pu, Y Wang, Z **a, Y Han, Y Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Rotated object detection aims to identify and locate objects in images with arbitrary
orientation. In this scenario, the oriented directions of objects vary considerably across …

Guardar Citar Citado por 97 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

Not all patches are what you need: Expediting vision transformers via token reorganizations

Y Liang, C Ge, Z Tong, Y Song, J Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

Vision Transformers (ViTs) take all the image patches as tokens and construct multi-head
self-attention (MHSA) among them. Complete leverage of these image tokens brings …

Guardar Citar Citado por 351 Artículos relacionados Las 7 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Flexivit: One model for all patch sizes

L Beyer, P Izmailov, A Kolesnikov… - Proceedings of the …, 2023 - openaccess.thecvf.com

Vision Transformers convert images to sequences by slicing them into patches. The size of
these patches controls a speed/accuracy tradeoff, with smaller patches leading to higher …

Guardar Citar Citado por 97 Artículos relacionados Las 5 versiones Versión en HTML

[Free GPT-4]

[PDF] thecvf.com

Adavit: Adaptive vision transformers for efficient image recognition

L Meng, H Li, BC Chen, S Lan, Z Wu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Built on top of self-attention mechanisms, vision transformers have demonstrated
remarkable performance on a variety of vision tasks recently. While achieving excellent …

Guardar Citar Citado por 256 Artículos relacionados Las 5 versiones Versión en HTML

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

A comprehensive survey of transformers for computer vision

Flatten transformer: Vision transformer using focused linear attention

Measuring and narrowing the compositionality gap in language models

Vision transformer with deformable attention

A survey of visual transformers

Dynamic neural networks: A survey

Adaptive rotated convolution for rotated object detection

Not all patches are what you need: Expediting vision transformers via token reorganizations

Flexivit: One model for all patch sizes

Adavit: Adaptive vision transformers for efficient image recognition