Smallcap: lightweight image captioning prompted with retrieval augmentation

R Ramos, B Martins, D Elliott… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advances in image captioning have focused on scaling the data and model size,
substantially increasing the cost of pre-training and finetuning. As an alternative to large …

Do you remember? dense video captioning with cross-modal memory retrieval

M Kim, HB Kim, J Moon, J Choi… - Proceedings of the …, 2024 - openaccess.thecvf.com
There has been significant attention to the research on dense video captioning which aims
to automatically localize and caption all events within untrimmed video. Several studies …

Retrieval-augmented image captioning

R Ramos, D Elliott, B Martins - arxiv preprint arxiv:2302.08268, 2023 - arxiv.org
Inspired by retrieval-augmented language generation and pretrained Vision and Language
(V&L) encoders, we present a new approach to image captioning that generates sentences …

TADACap: Time-series Adaptive Domain-Aware Captioning

E Fons, R Kaur, Z Zeng, S Palande, T Balch… - Proceedings of the 5th …, 2024 - dl.acm.org
While image captioning has gained significant attention, the potential of captioning time-
series images, prevalent in areas like finance and healthcare, remains largely untapped …

Towards a sentiment-aware conversational agent

I Dias, R Rei, P Pereira, L Coheur - Proceedings of the 22nd ACM …, 2022 - dl.acm.org
We propose an end-to-end sentiment-aware conversational agent based on two models: a
reply sentiment prediction model and a text generation model, conditioned on the predicted …

[ЦИТАТА][C] Deep Learning for Remote Sensing Image Captioning

JMD Barata - 2021