- Academic Search

A Khan, Z Rauf, A Sohail, AR Khan, H Asif… - Artificial Intelligence …, 2023 - Springer

Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …

Opslaan Citeren Geciteerd door 110 Verwante artikelen Alle 11 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

P2T: Pyramid pooling transformer for scene understanding

YH Wu, Y Liu, X Zhan… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Recently, the vision transformer has achieved great success by pushing the state-of-the-art
of various vision tasks. One of the most challenging problems in the vision transformer is that …

Opslaan Citeren Geciteerd door 265 Verwante artikelen Alle 15 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Robust principles: Architectural design principles for adversarially robust cnns

SY Peng, W Xu, C Cornelius, M Hull, K Li… - ar** methods and systems
that can automatically detect and recognize falls, particularly among the elderly and …

Opslaan Citeren Geciteerd door 1 Verwante artikelen Alle 3 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cascast: Skillful high-resolution precipitation nowcasting via cascaded modelling

J Gong, L Bai, P Ye, W Xu, N Liu, J Dai, X Yang… - arxiv preprint arxiv …, 2024 - arxiv.org

Precipitation nowcasting based on radar data plays a crucial role in extreme weather
prediction and has broad implications for disaster management. Despite progresses have …

Opslaan Citeren Geciteerd door 16 Verwante artikelen Alle 6 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

6d-vit: Category-level 6d object pose estimation via transformer-based instance representation learning

L Zou, Z Huang, N Gu, G Wang - IEEE Transactions on Image …, 2022 - ieeexplore.ieee.org

This paper presents 6D vision transformer (6D-ViT), a transformer-based instance
representation learning network suitable for highly accurate category-level object pose …

Opslaan Citeren Geciteerd door 43 Verwante artikelen Alle 5 versies

Transformer-based multi-attention hybrid networks for skin lesion segmentation

Z Dong, J Li, Z Hua - Expert Systems with Applications, 2024 - Elsevier

High-precision segmentation of skin lesions is essential for early diagnosis of skin cancer
and improved patient survival. However, this task becomes challenging due to the …

Opslaan Citeren Geciteerd door 12 Verwante artikelen Alle 2 versies

Ringmo-lite: A remote sensing lightweight network with cnn-transformer hybrid framework

Y Wang, T Zhang, L Zhao, L Hu, Z Wang… - … on Geoscience and …, 2024 - ieeexplore.ieee.org

In recent years, remote sensing (RS) vision foundation models, such as RingMo, have
emerged and achieved excellent performance in various downstream tasks. However, the …

Opslaan Citeren Geciteerd door 13 Verwante artikelen Alle 3 versies

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

STB-VMM: Swin transformer based video motion magnification

R Lado-Roigé, MA Pérez - Knowledge-Based Systems, 2023 - Elsevier

The goal of video motion magnification techniques is to magnify small motions in a video to
reveal previously invisible or unseen movement. Its uses extend from bio-medical …

Opslaan Citeren Geciteerd door 15 Verwante artikelen Alle 7 versies

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Utilizing adaptive deformable convolution and position embedding for colon polyp segmentation with a visual transformer

MY Sikkandar, SG Sundaram, A Alassaf… - Scientific Reports, 2024 - nature.com

Polyp detection is a challenging task in the diagnosis of Colorectal Cancer (CRC), and it
demands clinical expertise due to the diverse nature of polyps. The recent years have …

Opslaan Citeren Geciteerd door 4 Verwante artikelen Alle 7 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Vision transformers with hierarchical attention

A survey of the vision transformers and their CNN-transformer based variants

P2T: Pyramid pooling transformer for scene understanding

Robust principles: Architectural design principles for adversarially robust cnns

Cascast: Skillful high-resolution precipitation nowcasting via cascaded modelling

6d-vit: Category-level 6d object pose estimation via transformer-based instance representation learning

Transformer-based multi-attention hybrid networks for skin lesion segmentation

Ringmo-lite: A remote sensing lightweight network with cnn-transformer hybrid framework

STB-VMM: Swin transformer based video motion magnification

Utilizing adaptive deformable convolution and position embedding for colon polyp segmentation with a visual transformer