A review of multimodal explainable artificial intelligence: Past, present and future

S Sun, W An, F Tian, F Nan, Q Liu, J Liu, N Shah… - arxiv preprint arxiv …, 2024 - arxiv.org
Artificial intelligence (AI) has rapidly developed through advancements in computational
power and the growth of massive datasets. However, this progress has also heightened …

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

Z Qin, D Chen, W Zhang, L Yao, Y Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of large language models (LLMs) has been witnessed in recent
years. Based on the powerful LLMs, multi-modal LLMs (MLLMs) extend the modality from …

Unified Generative and Discriminative Training for Multi-modal Large Language Models

W Chow, J Li, Q Yu, K Pan, H Fei, Z Ge, S Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent times, Vision-Language Models (VLMs) have been trained under two predominant
paradigms. Generative training has enabled Multimodal Large Language Models (MLLMs) …

[PDF][PDF] DCoT: Dual Chain-of-Thought Prompting for Large Multimodal Models

Z Jia, J Liu, H Li, Q Liu, H Gao - The 16th Asian …, 2024 - raw.githubusercontent.com
Inference augmentation techniques such as Chain-of-Thought have already made their
mark in Large Language Models (LLMs). However, transferring these advances to Large …