Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension

Y **e, K Yang, N Yang, W Deng, X Dai, T Gu… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advances in Large Language Models (LLMs) have catalyzed the development of
Large Multimodal Models (LMMs). However, existing research primarily focuses on tuning …