Google Acadèmic

Y Dang, K Huang, J Huo, Y Yan, S Huang, D Liu… - arxiv preprint arxiv …, 2024 - arxiv.org

The rapid development of Artificial Intelligence (AI) has revolutionized numerous fields, with
large language models (LLMs) and computer vision (CV) systems driving advancements in …

Desa Cita Citat per 8 Articles relacionats Totes les 3 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images

L Wang, C Qi, C Ou, L An, M **… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Existing multi-modal learning methods on fundus and OCT images mostly require both
modalities to be available and strictly paired for training and testing, which appears less …

Desa Cita Articles relacionats Totes les 3 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large Language Model with Region-guided Referring and Grounding for CT Report Generation

Z Chen, Y Bie, H **, H Chen - arxiv preprint arxiv:2411.15539, 2024 - arxiv.org

Computed tomography (CT) report generation is crucial to assist radiologists in interpreting
CT volumes, which can be time-consuming and labor-intensive. Existing methods primarily …

Desa Cita Articles relacionats Totes les 2 versions Free GPT-4 DeepSeek Versió HTML

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

Interpretable bilingual multimodal large language model for diverse biomedical tasks

Explainable and interpretable multimodal large language models: A comprehensive survey

MultiEYE: Dataset and Benchmark for OCT-Enhanced Retinal Disease Recognition from Fundus Images

Large Language Model with Region-guided Referring and Grounding for CT Report Generation