„Google“ mokslinčius

S Islam, H Elmekki, A Elsebai, J Bentahar… - Expert Systems with …, 2024 - Elsevier

Abstract Transformers are Deep Neural Networks (DNN) that utilize a self-attention
mechanism to capture contextual relationships within sequential data. Unlike traditional …

Išsaugoti Cituoti Cituoja 212 Susiję straipsniai Visos 8 versijos

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Deep multimodal data fusion

F Zhao, C Zhang, B Geng - ACM computing surveys, 2024 - dl.acm.org

Multimodal Artificial Intelligence (Multimodal AI), in general, involves various types of data
(eg, images, texts, or data collected from different sensors), feature engineering (eg …

Išsaugoti Cituoti Cituoja 41 Susiję straipsniai

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Multiscale vision transformers

H Fan, B **ong, K Mangalam, Y Li… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract We present Multiscale Vision Transformers (MViT) for video and image recognition,
by connecting the seminal idea of multiscale feature hierarchies with transformer models …

Išsaugoti Cituoti Cituoja 1573 Susiję straipsniai Visos 6 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Conceptual 12m: Pushing web-scale image-text pre-training to recognize long-tail visual concepts

S Changpinyo, P Sharma, N Ding… - Proceedings of the …, 2021 - openaccess.thecvf.com

The availability of large-scale image captioning and visual question answering datasets has
contributed significantly to recent successes in vision-and-language pre-training. However …

Išsaugoti Cituoti Cituoja 1107 Susiję straipsniai Visos 8 versijos HTML kopija

Task-adaptive attention for image captioning

C Yan, Y Hao, L Li, J Yin, A Liu, Z Mao… - … on Circuits and …, 2021 - ieeexplore.ieee.org

Attention mechanisms are now widely used in image captioning models. However, most
attention models only focus on visual features. When generating syntax related words, little …

Išsaugoti Cituoti Cituoja 284 Susiję straipsniai Visos 2 versijos

Image encryption algorithm based on a 2D-CLSS hyperchaotic map using simultaneous permutation and diffusion

L Teng, X Wang, Y **an - Information Sciences, 2022 - Elsevier

A two-dimensional cross-mode hyperchaotic map based on logistic and sine maps (2D-
CLSS) is presented. The hyperchaotic map consists of a logistic map and two sine maps …

Išsaugoti Cituoti Cituoja 150 Susiję straipsniai Visos 2 versijos

Fine-grained visual classification via internal ensemble learning transformer

Q Xu, J Wang, B Jiang, B Luo - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Recently, vision transformers (ViTs) have been investigated in fine-grained visual
recognition (FGVC) and are now considered state of the art. However, most ViT-based works …

Išsaugoti Cituoti Cituoja 86 Susiję straipsniai Visos 2 versijos

An edge traffic flow detection scheme based on deep learning in an intelligent transportation system

C Chen, B Liu, S Wan, P Qiao… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

An intelligent transportation system (ITS) plays an important role in public transport
management, security and other issues. Traffic flow detection is an important part of the ITS …

Išsaugoti Cituoti Cituoja 388 Susiję straipsniai Visos 4 versijos

Deep CNN for brain tumor classification

W Ayadi, W Elhamzi, I Charfi, M Atri - Neural processing letters, 2021 - Springer

Brain tumor represents one of the most fatal cancers around the world. It is common cancer
in adults and children. It has the lowest survival rate and various types depending on their …

Išsaugoti Cituoti Cituoja 322 Susiję straipsniai Visos 5 versijos

Bagfn: broad attentive graph fusion network for high-order feature interactions

Z **e, W Zhang, B Sheng, P Li… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Modeling feature interactions is of crucial significance to high-quality feature engineering on
multifiled sparse data. At present, a series of state-of-the-art methods extract cross features …

Išsaugoti Cituoti Cituoja 166 Susiję straipsniai Visos 6 versijos

Kurti įspėjimą

Cituoti

Išplėstinė paieška

Išsaugota skiltyje „Mano biblioteka“

Multimodal transformer with multi-view visual representation for image captioning

A comprehensive survey on applications of transformers for deep learning tasks

Deep multimodal data fusion

Multiscale vision transformers

Conceptual 12m: Pushing web-scale image-text pre-training to recognize long-tail visual concepts

Task-adaptive attention for image captioning

Image encryption algorithm based on a 2D-CLSS hyperchaotic map using simultaneous permutation and diffusion

Fine-grained visual classification via internal ensemble learning transformer

An edge traffic flow detection scheme based on deep learning in an intelligent transportation system

Deep CNN for brain tumor classification

Bagfn: broad attentive graph fusion network for high-order feature interactions