AMAM: an attention-based multimodal alignment model for medical visual question answering
H Pan, S He, K Zhang, B Qu, C Chen, K Shi - Knowledge-Based Systems, 2022 - Elsevier
Abstract Medical Visual Question Answering (VQA) is a multimodal task to answer clinical
questions about medical images. Existing methods have achieved good performance, but …
questions about medical images. Existing methods have achieved good performance, but …
Optimal deep neural network-based model for answering visual medical question
Over the last few years, the amount of available information has increased exponentially in
all professional fields, including the medical field. Modern-day patients have access to a …
all professional fields, including the medical field. Modern-day patients have access to a …
Relation-enhanced detr for component detection in graphic design reverse engineering
It is a common practice for designers to create digital prototypes from a mock-up/screenshot.
Reverse engineering graphic design by detecting its components (eg, text, icon, button) …
Reverse engineering graphic design by detecting its components (eg, text, icon, button) …
Overview of ImageCLEFtuberculosis 2021: CT-based tuberculosis type classification
Résumé ImageCLEF is a part of the Conference and Labs of the Evaluation Forum (CLEF)
initiative and includes a variety of tasks dedicated to multimodal image information retrieval …
initiative and includes a variety of tasks dedicated to multimodal image information retrieval …
From Data to Diagnosis: Enhancing Radiology Reporting with Clinical Features Encoding and Cross-Modal Coherence
The integration of radiology reports for healthcare treatment using AI presents a
transformative opportunity to enhance patient care and optimize healthcare delivery …
transformative opportunity to enhance patient care and optimize healthcare delivery …
ACapMed: Automatic Captioning for Medical Imaging
Medical image captioning is a very challenging task that has been rarely addressed in the
literature on natural image captioning. Some existing image captioning techniques exploit …
literature on natural image captioning. Some existing image captioning techniques exploit …
Overview of ImageCLEFtuberculosis 2022: CT-based cavern detection and report
Résumé ImageCLEF is a part of the Conference and Labs of the Evaluation Forum (CLEF)
initiative and includes avariety of tasks dedicated to multimodal image information retrieval …
initiative and includes avariety of tasks dedicated to multimodal image information retrieval …
[PDF][PDF] SSN MLRG at VQA-MED 2021: An Approach for VQA to Solve Abnormality Related Queries using Improved Datasets.
NMS Sitara, K Srinivasan - CLEF (working notes), 2021 - researchgate.net
Abstract The Visual Question Answering (VQA) in the medical domain attains tremendous
advancement in last few years. To improvise the VQA research, ImageCLEF forum is …
advancement in last few years. To improvise the VQA research, ImageCLEF forum is …
What Happened in CLEF For Another While?
N Ferro - International Conference of the Cross-Language …, 2024 - Springer
Abstract 2024 marks the 25 th birthday for CLEF, an evaluation campaign activity which has
applied the Cranfield evaluation paradigm to the testing of multilingual and multimodal …
applied the Cranfield evaluation paradigm to the testing of multilingual and multimodal …
[PDF][PDF] Shengyan at VQA-Med 2020: An Encoder-Decoder Model for Medical Domain Visual Question Answering Task.
S Liu, H Ding, X Zhou - CLEF (working notes), 2020 - star.informatik.rwth-aachen.de
Intelligent learning and understanding of image and text information are important research
directions for the successful application of deep learning in computer vision (CV) and natural …
directions for the successful application of deep learning in computer vision (CV) and natural …