Vision-language models for medical report generation and visual question answering: A review
Medical vision-language models (VLMs) combine computer vision (CV) and natural
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …
Rocov2: Radiology objects in context version 2, an updated multimodal image dataset
Automated medical image analysis systems often require large amounts of training data with
high quality labels, which are difficult and time consuming to generate. This paper …
high quality labels, which are difficult and time consuming to generate. This paper …
Overview of the ImageCLEF 2024: Multimedia retrieval in medical applications
This paper presents an overview of the ImageCLEF 2024 lab, organized as part of the
Conference and Labs of the Evaluation Forum–CLEF Labs 2024. ImageCLEF, an ongoing …
Conference and Labs of the Evaluation Forum–CLEF Labs 2024. ImageCLEF, an ongoing …
Masked vision and language pre-training with unimodal and multimodal contrastive losses for medical visual question answering
Medical visual question answering (VQA) is a challenging task that requires answering
clinical questions of a given medical image, by taking consider of both visual and language …
clinical questions of a given medical image, by taking consider of both visual and language …
Self-supervised vision-language pretraining for medial visual question answering
P Li, G Liu, L Tan, J Liao… - 2023 IEEE 20th …, 2023 - ieeexplore.ieee.org
Medical image visual question answering (VQA) is a task to answer clinical questions, given
a radiographic image, which is a challenging problem that requires a model to integrate both …
a radiographic image, which is a challenging problem that requires a model to integrate both …
Overview of the ImageCLEF 2022: Multimedia retrieval in medical, social media and nature applications
This paper presents an overview of the ImageCLEF 2022 lab that was organized as part of
the Conference and Labs of the Evaluation Forum–CLEF Labs 2022. ImageCLEF is an …
the Conference and Labs of the Evaluation Forum–CLEF Labs 2022. ImageCLEF is an …
Overview of ImageCLEFmedical 2023–caption prediction and concept detection
J Rückert, A Ben Abacha… - Working Notes of the …, 2023 - arodes.hes-so.ch
Résumé The 2023 ImageCLEFmedical GANs task is the first edition of this task, examining
the existing hypothesis that GANs (Generative Adversarial Networks) are generating …
the existing hypothesis that GANs (Generative Adversarial Networks) are generating …
[PDF][PDF] Aueb nlp group at imageclefmedical caption 2022
F Charalampakos, G Zachariadis… - … Working Notes, CEUR …, 2022 - ceur-ws.org
We present the methods AUEB's NLP Group used to participate in the annual
ImageCLEFmedical Caption Task. The task comprises of the Concept Detection and the …
ImageCLEFmedical Caption Task. The task comprises of the Concept Detection and the …
ImageCLEF 2023 highlight: multimedia retrieval in medical, social media and content recommendation applications
In this paper, we provide an overview of the upcoming ImageCLEF campaign. ImageCLEF is
part of the CLEF Conference and Labs of the Evaluation Forum since 2003. ImageCLEF, the …
part of the CLEF Conference and Labs of the Evaluation Forum since 2003. ImageCLEF, the …
SciOL and MuLMS-Img: Introducing A Large-Scale Multimodal Scientific Dataset and Models for Image-Text Tasks in the Scientific Domain
In scientific publications, a substantial part of the information is expressed via figures
containing images and diagrams. Hence, the retrieval of relevant figures given a natural …
containing images and diagrams. Hence, the retrieval of relevant figures given a natural …