Követés
Nicholas Moratelli
Cím
Hivatkozott rá
Hivatkozott rá
Év
The Revolution of Multimodal Large Language Models: A Survey
D Caffagni, F Cocchi, L Barsellotti, N Moratelli, S Sarto, L Baraldi, ...
Findings of the Association for Computational Linguistics (ACL), 2024
432024
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
D Caffagni, F Cocchi, N Moratelli, S Sarto, M Cornia, L Baraldi, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
272024
Fashion-oriented image captioning with external knowledge retrieval and fully attentive gates
N Moratelli, M Barraco, D Morelli, M Cornia, L Baraldi, R Cucchiara
Sensors 23 (3), 1286, 2023
172023
Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization
N Moratelli, D Caffagni, M Cornia, L Baraldi, R Cucchiara
Proceedings of the British Machine Vision Conference 2024 (BMVC Oral), 2024
42024
Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis
D Bucciarelli, N Moratelli, M Cornia, L Baraldi, R Cucchiara
Proceedings of the European Conference on Computer Vision Workshops (ECCVW), 2024
32024
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization
N Moratelli, M Barraco, M Cornia, L Baraldi, R Cucchiara
IEEE Intelligent Systems, 2024
22024
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training
S Sarto, N Moratelli, M Cornia, L Baraldi, R Cucchiara
arXiv preprint arXiv:2410.07336, 2024
12024
Causal Graphical Models for Vision-Language Compositional Understanding
F Parascandolo, N Moratelli, E Sangineto, L Baraldi, R Cucchiara
The Thirteenth International Conference on Learning Representations (ICLR 2025), 2024
2024
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
F Cocchi, N Moratelli, M Cornia, L Baraldi, R Cucchiara
arXiv preprint arXiv:2411.16863, 2024
2024
Fluent and Accurate Image Captioning with a Self-Trained Reward Model
N Moratelli, M Cornia, L Baraldi, R Cucchiara
International Conference on Pattern Recognition (ICPR Oral), 2024
2024
Descrizione di immagini in linguaggio naturale utilizzando un nuovo meccanismo di attenzione e conoscenza esterna
N MORATELLI
2022
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–11