Positive-Augmented Constrastive Learning for Image and Video Captioning Evaluation S Sarto, M Barraco, M Cornia, L Baraldi, R Cucchiara IEEE/CVF Conference on Computer Vision and Pattern Recognition (Highlight Paper), 2023 | 55 | 2023 |
Retrieval-augmented transformer for image captioning S Sarto, M Cornia, L Baraldi, R Cucchiara International Conference on Content-based Multimedia Indexing, 1-7, 2022 | 52 | 2022 |
The Revolution of Multimodal Large Language Models: A Survey D Caffagni, F Cocchi, L Barsellotti, N Moratelli, S Sarto, L Baraldi, ... Association for Computational Linguistics (Findings), 2024 | 44 | 2024 |
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs D Caffagni, F Cocchi, N Moratelli, S Sarto, M Cornia, L Baraldi, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024 | 27 | 2024 |
With a little help from your own past: Prototypical memory networks for image captioning M Barraco, S Sarto, M Cornia, L Baraldi, R Cucchiara Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 18 | 2023 |
Multi-class unlearning for image classification via weight filtering S Poppi, S Sarto, M Cornia, L Baraldi, R Cucchiara IEEE Intelligent Systems, 2024 | 7* | 2024 |
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues S Sarto, M Cornia, L Baraldi, R Cucchiara European Conference on Computer Vision, 2024 | 6 | 2024 |
Towards Retrieval-Augmented Architectures for Image Captioning S Sarto, M Cornia, L Baraldi, A Nicolosi, R Cucchiara ACM Transactions on Multimedia Computing, Communications and Applications, 2024 | 5 | 2024 |
Video Surveillance and Privacy: A Solvable Paradox? R Cucchiara, L Baraldi, M Cornia, S Sarto Computer 57 (3), 91-100, 2024 | 3 | 2024 |
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training S Sarto, N Moratelli, M Cornia, L Baraldi, R Cucchiara arXiv preprint arXiv:2410.07336, 2024 | 2 | 2024 |
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios V Pipoli, F Bolelli, S Sarto, M Cornia, L Baraldi, C Grana, R Cucchiara, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025 | | 2025 |
Unlearning Vision Transformers Without Retaining Data via Low-Rank Decompositions S Poppi, S Sarto, M Cornia, L Baraldi, R Cucchiara International Conference on Pattern Recognition, 147-163, 2025 | | 2025 |
Transformer combinato con tecniche di retrieval per generazione di didascalie di immagini S SARTO | | 2022 |