Академия Google

Destnet: Densely fused spatial transformer networks

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

M-FFN: multi-scale feature fusion network for image captioning

J Prudviraj, C Vishnu, CK Mohan - Applied Intelligence, 2022 - Springer

In this work, we present a novel multi-scale feature fusion network (M-FFN) for image
captioning task to incorporate discriminative features and scene contextual information of an …

Сохранить Цитировать Цитируется: 65 Похожие статьи Все версии статьи (4)

Attentive contextual network for image captioning

J Prudviraj, C Vishnu, CK Mohan - 2021 International Joint …, 2021 - ieeexplore.ieee.org

Existing image captioning approaches fail to generate fine-grained captions due to the lack
of rich encoding representation of an image. In this paper, we present an attentive contextual …

Сохранить Цитировать Цитируется: 44 Похожие статьи Все версии статьи (2)

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Explicit disentanglement of appearance and perspective in generative models

N Skafte, S Hauberg - Advances in Neural Information …, 2019 - proceedings.neurips.cc

Disentangled representation learning finds compact, independent and easy-to-interpret
factors of the data. Learning such has been shown to require an inductive bias, which we …

Сохранить Цитировать Цитируется: 53 Похожие статьи Все версии статьи (11) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Jointly aligning millions of images with deep penalised reconstruction congealing

R Annunziata, C Sagonas… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Extrapolating fine-grained pixel-level correspondences in a fully unsupervised manner from
a large set of misaligned images can benefit several computer vision and graphics …

Сохранить Цитировать Цитируется: 11 Похожие статьи Все версии статьи (5) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] sciendo.com

[PDF][PDF] Feature map augmentation to improve scale invariance in convolutional neural networks

D Kumar, D Sharma - Journal of Artificial Intelligence and Soft …, 2023 - sciendo.com

Introducing variation in the training dataset through data augmentation has been a popular
technique to make Convolutional Neural Networks (CNNs) spatially invariant but leads to …

Сохранить Цитировать Цитируется: 4 Похожие статьи Все версии статьи (8) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adjoint rigid transform network: Task-conditioned alignment of 3d shapes

K Zhou, BL Bhatnagar, B Schiele… - … conference on 3D …, 2022 - ieeexplore.ieee.org

Most learning methods for 3D data suffer significant performance drops when the data is not
carefully aligned to a canonical orientation. Aligning real world 3D data collected from …

Сохранить Цитировать Цитируется: 5 Похожие статьи Все версии статьи (7)

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Cot-DCN-YOLO: Self-attention-enhancing YOLOv8s for detecting garbage bins in urban street view images

S Dong, W Xu, H Zhang, L Gong - The Egyptian Journal of Remote Sensing …, 2025 - Elsevier

Accurately and quickly obtaining information from garbage bins has great application value
in smart city construction and urban environmental management. However, existing deep …

Сохранить Цитировать Похожие статьи Все версии статьи (6)

[Free GPT-4]
[DeepSeek]

[PDF] canberra.edu.au

[PDF][PDF] Multi-modal information extraction and fusion with convolutional neural networks for classification of scaled images

D Kumar - 2020 - researchprofiles.canberra.edu.au

Develo** computational algorithms to model the biological vision system has challenged
researchers in the computer vision field for several decades. As a result, state-of-the-art …

Сохранить Цитировать Цитируется: 2 Похожие статьи Поиск в библиотеках

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Destnet: Densely fused spatial transformer networks

M-FFN: multi-scale feature fusion network for image captioning

Attentive contextual network for image captioning

Explicit disentanglement of appearance and perspective in generative models

Jointly aligning millions of images with deep penalised reconstruction congealing

[PDF][PDF] Feature map augmentation to improve scale invariance in convolutional neural networks

Adjoint rigid transform network: Task-conditioned alignment of 3d shapes

[HTML][HTML] Cot-DCN-YOLO: Self-attention-enhancing YOLOv8s for detecting garbage bins in urban street view images

[PDF][PDF] Multi-modal information extraction and fusion with convolutional neural networks for classification of scaled images