Študovňa Google

H He, J Cai, J Liu, Z Pan, J Zhang… - IEEE transactions on …, 2024 - ieeexplore.ieee.org

Vision Transformers (ViTs) have achieved impressive performance over various computer
vision tasks. However, modeling global correlations with multi-head self-attention (MSA) …

Uložiť Citovať Citované 50-krát Súvisiace články Všetky verzie 9

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards flexible inductive bias via progressive reparameterization scheduling

Y Lee, G Lee, K Ryoo, H Go, J Park, S Kim - European Conference on …, 2022 - Springer

There are two de facto standard architectures in recent computer vision: Convolutional
Neural Networks (CNNs) and Vision Transformers (ViTs). Strong inductive biases of …

Uložiť Citovať Citované 4-krát Súvisiace články Všetky verzie 7

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Overparametrization, architecture and dynamics of deep neural networks: from theory to practice

S d'Ascoli - 2022 - theses.hal.science

Deep learning has become the cornerstone of artificial intelligence, and has fueled
breakthroughs in a number of fields. Yet, the key reasons underpinning the success of deep …

Uložiť Citovať Citované 1-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

CSA-BERT: Video Question Answering

K Jenni, M Srinivas, R Sannapu… - 2023 IEEE Statistical …, 2023 - ieeexplore.ieee.org

Convolutional networks are a key component of many computer vision applications.
However, convolutions have a serious flaw. It only works in a small area, hence it lacks …

Uložiť Citovať Súvisiace články Všetky verzie 2

Towards Efficient Training and Inference of Large Transformer Models

H He - 2024 - bridges.monash.edu

Transformers have revolutionized modern applications but are costly as model sizes grow.
This thesis targets efficient training and inference of large Transformer models. We first …

Uložiť Citovať Súvisiace články V pamäti

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adaptive Attention Link-based Regularization for Vision Transformers

H **, J Choi - arxiv preprint arxiv:2211.13852, 2022 - arxiv.org

Although transformer networks are recently employed in various vision tasks with
outperforming performance, extensive training data and a lengthy training time are required …

Uložiť Citovať Súvisiace články Všetky verzie 3 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

[PDF][PDF] Tackling the 2021 Algonauts Challenge with Semi-Supervised Networks & Bayesian Optimization

RT Lange - algonauts.csail.mit.edu

Deep neural networks have been widely adopted as state-of-the-art models of the visual
ventral stream (eg Cadieu et al., 2014; Yamins and DiCarlo, 2016; Cichy et al., 2016). Most …

Uložiť Citovať Súvisiace články HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Transformed CNNs: recasting pre-trained convolutional layers with self-attention

Pruning self-attentions into convolutional layers in single path

Towards flexible inductive bias via progressive reparameterization scheduling

Overparametrization, architecture and dynamics of deep neural networks: from theory to practice

CSA-BERT: Video Question Answering

Towards Efficient Training and Inference of Large Transformer Models

Adaptive Attention Link-based Regularization for Vision Transformers

[PDF][PDF] Tackling the 2021 Algonauts Challenge with Semi-Supervised Networks & Bayesian Optimization