- Academic Search

M Stefanini, M Cornia, L Baraldi… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Connecting Vision and Language plays an essential role in Generative Intelligence. For this
reason, large research efforts have been devoted to image captioning, ie describing images …

Speichern Zitieren Zitiert von: 396 Ähnliche Artikel Alle 11 Versionen

[Free GPT-4]

[PDF] arxiv.org

Deep learning approaches on image captioning: A review

T Ghandi, H Pourreza, H Mahyar - ACM Computing Surveys, 2023 - dl.acm.org

Image captioning is a research area of immense importance, aiming to generate natural
language descriptions for visual content in the form of still images. The advent of deep …

Speichern Zitieren Zitiert von: 102 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] arxiv.org

Skeleton-based action recognition via spatial and temporal transformer networks

C Plizzari, M Cannici, M Matteucci - Computer Vision and Image …, 2021 - Elsevier

Abstract Skeleton-based Human Activity Recognition has achieved great interest in recent
years as skeleton data has demonstrated being robust to illumination changes, body scales …

Speichern Zitieren Zitiert von: 350 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] thecvf.com

Spatial-temporal transformer for dynamic scene graph generation

Y Cong, W Liao, H Ackermann… - Proceedings of the …, 2021 - openaccess.thecvf.com

Dynamic scene graph generation aims at generating a scene graph of the given video.
Compared to the task of scene graph generation from images, it is more challenging …

Speichern Zitieren Zitiert von: 153 Ähnliche Artikel Alle 12 Versionen HTML-Version

Why did the AI make that decision? Towards an explainable artificial intelligence (XAI) for autonomous driving systems

J Dong, S Chen, M Miralinaghi, T Chen, P Li… - … research part C …, 2023 - Elsevier

User trust has been identified as a critical issue that is pivotal to the success of autonomous
vehicle (AV) operations where artificial intelligence (AI) is widely adopted. For such …

Speichern Zitieren Zitiert von: 46 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] nature.com

β-Variational autoencoders and transformers for reduced-order modelling of fluid flows

A Solera-Rico, C Sanmiguel Vila… - Nature …, 2024 - nature.com

Variational autoencoder architectures have the potential to develop reduced-order models
for chaotic fluid flows. We propose a method for learning compact and near-orthogonal …

Speichern Zitieren Zitiert von: 54 Ähnliche Artikel Alle 12 Versionen

Automated radiographic report generation purely on transformer: A multicriteria supervised approach

Z Wang, H Han, L Wang, X Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Automated radiographic report generation is challenging in at least two aspects. First,
medical images are very similar to each other and the visual differences of clinic importance …

Speichern Zitieren Zitiert von: 68 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] thecvf.com

Dynamic scene graph generation via anticipatory pre-training

Y Li, X Yang, C Xu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Humans can not only see the collection of objects in visual scenes, but also identify the
relationship between objects. The visual relationship in the scene can be abstracted into the …

Speichern Zitieren Zitiert von: 41 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Dual graph convolutional networks with transformer and curriculum learning for image captioning

X Dong, C Long, W Xu, C **ao - Proceedings of the 29th ACM …, 2021 - dl.acm.org

Existing image captioning methods just focus on understanding the relationship between
objects or instances in a single image, without exploring the contextual correlation existed …

Speichern Zitieren Zitiert von: 69 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] acm.org

Reformer: The relational transformer for image captioning

X Yang, Y Liu, X Wang - Proceedings of the 30th ACM International …, 2022 - dl.acm.org

Image captioning is shown to be able to achieve a better performance by using scene
graphs to represent the relations of objects in the image. The current captioning encoders …

Speichern Zitieren Zitiert von: 65 Ähnliche Artikel Alle 3 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Image captioning through image transformer

From show to tell: A survey on deep learning-based image captioning

Deep learning approaches on image captioning: A review

Skeleton-based action recognition via spatial and temporal transformer networks

Spatial-temporal transformer for dynamic scene graph generation

Why did the AI make that decision? Towards an explainable artificial intelligence (XAI) for autonomous driving systems

β-Variational autoencoders and transformers for reduced-order modelling of fluid flows

Automated radiographic report generation purely on transformer: A multicriteria supervised approach

Dynamic scene graph generation via anticipatory pre-training

Dual graph convolutional networks with transformer and curriculum learning for image captioning

Reformer: The relational transformer for image captioning