Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning

X Yu, Z Yang, C Chen - arxiv preprint arxiv:2501.15470, 2025 - arxiv.org
Multimodal Retrieval Augmented Generation (MRAG) systems, while promising for
enhancing Multimodal Large Language Models (MLLMs), often rely on rigid, single-step …