AMAM: an attention-based multimodal alignment model for medical visual question answering

H Pan, S He, K Zhang, B Qu, C Chen, K Shi - Knowledge-Based Systems, 2022 - Elsevier
Abstract Medical Visual Question Answering (VQA) is a multimodal task to answer clinical
questions about medical images. Existing methods have achieved good performance, but …

Optimal deep neural network-based model for answering visual medical question

K Gasmi, IB Ltaifa, G Lejeune… - Cybernetics and …, 2022 - Taylor & Francis
Over the last few years, the amount of available information has increased exponentially in
all professional fields, including the medical field. Modern-day patients have access to a …

Relation-enhanced detr for component detection in graphic design reverse engineering

X Hao, D Huang, J Lin, CY Lin - … of the Thirty-Second International Joint …, 2023 - dl.acm.org
It is a common practice for designers to create digital prototypes from a mock-up/screenshot.
Reverse engineering graphic design by detecting its components (eg, text, icon, button) …

Overview of ImageCLEFtuberculosis 2021: CT-based tuberculosis type classification

S Kozlovski, V Liauchuk, Y Dicente Cid… - Proceedings of the …, 2021 - arodes.hes-so.ch
Résumé ImageCLEF is a part of the Conference and Labs of the Evaluation Forum (CLEF)
initiative and includes a variety of tasks dedicated to multimodal image information retrieval …

From Data to Diagnosis: Enhancing Radiology Reporting with Clinical Features Encoding and Cross-Modal Coherence

S Iqbal, AN Qureshi, F Khan, K Aurangzeb… - IEEE …, 2024 - ieeexplore.ieee.org
The integration of radiology reports for healthcare treatment using AI presents a
transformative opportunity to enhance patient care and optimize healthcare delivery …

ACapMed: Automatic Captioning for Medical Imaging

DR Beddiar, M Oussalah, T Seppänen, R Jennane - Applied Sciences, 2022 - mdpi.com
Medical image captioning is a very challenging task that has been rarely addressed in the
literature on natural image captioning. Some existing image captioning techniques exploit …

Overview of ImageCLEFtuberculosis 2022: CT-based cavern detection and report

S Kozlovski, Y Dicente Cid, V Kovalev… - … 2022: Conference and …, 2022 - arodes.hes-so.ch
Résumé ImageCLEF is a part of the Conference and Labs of the Evaluation Forum (CLEF)
initiative and includes avariety of tasks dedicated to multimodal image information retrieval …

[PDF][PDF] SSN MLRG at VQA-MED 2021: An Approach for VQA to Solve Abnormality Related Queries using Improved Datasets.

NMS Sitara, K Srinivasan - CLEF (working notes), 2021 - researchgate.net
Abstract The Visual Question Answering (VQA) in the medical domain attains tremendous
advancement in last few years. To improvise the VQA research, ImageCLEF forum is …

What Happened in CLEF For Another While?

N Ferro - International Conference of the Cross-Language …, 2024 - Springer
Abstract 2024 marks the 25 th birthday for CLEF, an evaluation campaign activity which has
applied the Cranfield evaluation paradigm to the testing of multilingual and multimodal …

[PDF][PDF] Shengyan at VQA-Med 2020: An Encoder-Decoder Model for Medical Domain Visual Question Answering Task.

S Liu, H Ding, X Zhou - CLEF (working notes), 2020 - star.informatik.rwth-aachen.de
Intelligent learning and understanding of image and text information are important research
directions for the successful application of deep learning in computer vision (CV) and natural …