Explainable and interpretable multimodal large language models: A comprehensive survey
The rapid development of Artificial Intelligence (AI) has revolutionized numerous fields, with
large language models (LLMs) and computer vision (CV) systems driving advancements in …
large language models (LLMs) and computer vision (CV) systems driving advancements in …
MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images
Existing multi-modal learning methods on fundus and OCT images mostly require both
modalities to be available and strictly paired for training and testing, which appears less …
modalities to be available and strictly paired for training and testing, which appears less …
Large Language Model with Region-guided Referring and Grounding for CT Report Generation
Computed tomography (CT) report generation is crucial to assist radiologists in interpreting
CT volumes, which can be time-consuming and labor-intensive. Existing methods primarily …
CT volumes, which can be time-consuming and labor-intensive. Existing methods primarily …