Medtrinity-25m: A large-scale multimodal dataset with multigranular annotations for medicine

Y ** Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography
IE Hamamci, S Er, F Almas, AG Simsek, SN Esirgun… - 2024 - researchsquare.com
While computer vision has achieved tremendous success with multimodal encoding and
direct textual interaction with images via chat-based large language models, similar …

Read Like a Radiologist: Efficient Vision-Language Model for 3D Medical Imaging Interpretation

C Lee, S Park, CI Shin, WH Choi, HJ Park… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent medical vision-language models (VLMs) have shown promise in 2D medical image
interpretation. However extending them to 3D medical imaging has been challenging due to …