Google Академія

KI Roumeliotis, ND Tselikas - Future Internet, 2023 - mdpi.com

According to numerous reports, ChatGPT represents a significant breakthrough in the field of
artificial intelligence. ChatGPT is a pre-trained AI model designed to engage in natural …

Зберегти Послатися Цитовано в 508 джерелах Пов’язані статті Кількість версій: 6 Кеш

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier

The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Зберегти Послатися Цитовано в 155 джерелах Пов’язані статті Кількість версій: 8

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Efficientvit: Memory efficient vision transformer with cascaded group attention

X Liu, H Peng, N Zheng, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Vision transformers have shown great success due to their high model capabilities.
However, their remarkable performance is accompanied by heavy computation costs, which …

Зберегти Послатися Цитовано в 373 джерелах Пов’язані статті Кількість версій: 11 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PixArt-: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

J Chen, J Yu, C Ge, L Yao, E **e, Y Wu, Z Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

The most advanced text-to-image (T2I) models require significant training costs (eg, millions
of GPU hours), seriously hindering the fundamental innovation for the AIGC community …

Зберегти Послатися Цитовано в 409 джерелах Пов’язані статті Кількість версій: 4 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Flatten transformer: Vision transformer using focused linear attention

D Han, X Pan, Y Han, S Song… - Proceedings of the …, 2023 - openaccess.thecvf.com

The quadratic computation complexity of self-attention has been a persistent challenge
when applying Transformer models to vision tasks. Linear attention, on the other hand, offers …

Зберегти Послатися Цитовано в 194 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Hyena hierarchy: Towards larger convolutional language models

M Poli, S Massaroli, E Nguyen, DY Fu… - International …, 2023 - proceedings.mlr.press

Recent advances in deep learning have relied heavily on the use of large Transformers due
to their ability to learn at scale. However, the core building block of Transformers, the …

Зберегти Послатися Цитовано в 303 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PIXART-: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

J Chen, C Ge, E **e, Y Wu, L Yao, X Ren… - … on Computer Vision, 2024 - Springer

In this paper, we introduce PixArt-Σ, a Diffusion Transformer model (DiT) capable of directly
generating images at 4K resolution. PixArt-Σ represents a significant advancement over its …

Зберегти Послатися Цитовано в 116 джерелах Пов’язані статті Кількість версій: 5

Promptir: Prompting for all-in-one image restoration

V Potlapalli, SW Zamir, SH Khan… - Advances in Neural …, 2023 - proceedings.neurips.cc

Image restoration involves recovering a high-quality clean image from its degraded version.
Deep learning-based methods have significantly improved image restoration performance …

Зберегти Послатися Цитовано в 162 джерелах Пов’язані статті Кількість версій: 8 Кеш

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Inceptionnext: When inception meets convnext

W Yu, P Zhou, S Yan, X Wang - Proceedings of the IEEE/cvf …, 2024 - openaccess.thecvf.com

Inspired by the long-range modeling ability of ViTs large-kernel convolutions are widely
studied and adopted recently to enlarge the receptive field and improve model performance …

Зберегти Послатися Цитовано в 168 джерелах Пов’язані статті Кількість версій: 7 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Srformer: Permuted self-attention for single image super-resolution

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com

Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

Зберегти Послатися Цитовано в 162 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Tokens-to-token vit: Training vision transformers from scratch on imagenet

Chatgpt and open-ai models: A preliminary review

Advances in medical image analysis with vision transformers: a comprehensive review

Efficientvit: Memory efficient vision transformer with cascaded group attention

PixArt-: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Flatten transformer: Vision transformer using focused linear attention

Hyena hierarchy: Towards larger convolutional language models

PIXART-: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Promptir: Prompting for all-in-one image restoration

Inceptionnext: When inception meets convnext

Srformer: Permuted self-attention for single image super-resolution