Google Academic

Dense semantic embedding network for image captioning

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey

D Sharma, C Dhiman, D Kumar - Expert Systems with Applications, 2023 - Elsevier

Abstract Automatic Visual Captioning (AVC) generates syntactically and semantically correct
sentences by describing important objects, attributes, and their relationships with each other …

Salvați Citați Citat de 15 ori Articole cu conținut similar Toate cele 2 versiuni

A survey on advancements in image-text multimodal models: From general techniques to biomedical implementations

R Guo, J Wei, L Sun, B Yu, G Chang, D Liu… - Computers in biology …, 2024 - Elsevier

With the significant advancements of Large Language Models (LLMs) in the field of Natural
Language Processing (NLP), the development of image-text multimodal models has …

Salvați Citați Citat de 5 ori Articole cu conținut similar Toate cele 5 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Adaptive path selection for dynamic image captioning

T **an, Z Li, Z Tang, H Ma - … on Circuits and Systems for Video …, 2022 - ieeexplore.ieee.org

Image captioning is a challenging task, ie, given an image machine automatically generates
natural language that matches its semantic content and has attracted much attention in …

Salvați Citați Citat de 55 ori Articole cu conținut similar Toate cele 2 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Split, embed and merge: An accurate table structure recognizer

Z Zhang, J Zhang, J Du, F Wang - Pattern Recognition, 2022 - Elsevier

Table structure recognition is an essential part for making machines understand tables. Its
main task is to recognize the internal structure of a table. However, due to the complexity …

Salvați Citați Citat de 58 ori Articole cu conținut similar Toate cele 6 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring fine-grained image-text alignment for referring remote sensing image segmentation

S Lei, X **ao, T Zhang, HC Li, Z Shi… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Given a language expression, referring remote sensing image segmentation (RRSIS) aims
to identify ground objects and assign pixelwise labels within the imagery. One of the key …

Salvați Citați Citat de 10 ori Articole cu conținut similar Toate cele 5 versiuni

Transformer-based local-global guidance for image captioning

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Expert Systems with …, 2023 - Elsevier

Image captioning is a difficult problem for machine learning algorithms to compress huge
amounts of images into descriptive languages. The recurrent models are popularly used as …

Salvați Citați Citat de 24 ori Articole cu conținut similar Toate cele 2 versiuni

A multi-layer memory sharing network for video captioning

TZ Niu, SS Dong, ZD Chen, X Luo, Z Huang, S Guo… - Pattern Recognition, 2023 - Elsevier

Over the past several years, video captioning has received much attention in computer
vision and machine learning communities. Many models utilize an RNN-based decoder to …

Salvați Citați Citat de 16 ori Articole cu conținut similar Toate cele 4 versiuni

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Protect, show, attend and tell: Empowering image captioning models with ownership protection

JH Lim, CS Chan, KW Ng, L Fan, Q Yang - Pattern Recognition, 2022 - Elsevier

By and large, existing Intellectual Property (IP) protection on deep neural networks typically
i) focus on image classification task only, and ii) follow a standard digital watermarking …

Salvați Citați Citat de 44 ori Articole cu conținut similar Toate cele 7 versiuni

Image captioning using transformer-based double attention network

H Parvin, AR Naghsh-Nilchi, HM Mohammadi - Engineering Applications of …, 2023 - Elsevier

Image captioning generates a human-like description for a query image, which has attracted
considerable attention recently. The most broadly utilized model for image description is an …

Salvați Citați Citat de 10 ori Articole cu conținut similar Toate cele 2 versiuni

Divergent-convergent attention for image captioning

J Ji, Z Du, X Zhang - Pattern Recognition, 2021 - Elsevier

Attention mechanism has made great progress in image captioning, where semantic words
or local regions are selectively embedded into the language model. However, current …

Salvați Citați Citat de 33 ori Articole cu conținut similar Toate cele 2 versiuni

Creează alerta

Citați

Căutare avansată

Salvat în Bibliotecă

Dense semantic embedding network for image captioning

Evolution of visual data captioning Methods, Datasets, and evaluation Metrics: A comprehensive survey

A survey on advancements in image-text multimodal models: From general techniques to biomedical implementations

Adaptive path selection for dynamic image captioning

Split, embed and merge: An accurate table structure recognizer

Exploring fine-grained image-text alignment for referring remote sensing image segmentation

Transformer-based local-global guidance for image captioning

A multi-layer memory sharing network for video captioning

Protect, show, attend and tell: Empowering image captioning models with ownership protection

Image captioning using transformer-based double attention network

Divergent-convergent attention for image captioning