Advances in medical image analysis with vision transformers: a comprehensive review

R Azad, A Kazerouni, M Heidari, EK Aghdam… - Medical Image …, 2024 - Elsevier
The remarkable performance of the Transformer architecture in natural language processing
has recently also triggered broad interest in Computer Vision. Among other merits …

Deep learning for medical anomaly detection–a survey

T Fernando, H Gammulle, S Denman… - ACM Computing …, 2021 - dl.acm.org
Machine learning–based medical anomaly detection is an important problem that has been
extensively studied. Numerous approaches have been proposed across various medical …

Pranet: Parallel reverse attention network for polyp segmentation

DP Fan, GP Ji, T Zhou, G Chen, H Fu, J Shen… - … conference on medical …, 2020 - Springer
Colonoscopy is an effective technique for detecting colorectal polyps, which are highly
related to colorectal cancer. In clinical practice, segmenting polyps from colonoscopy …

Omnimedvqa: A new large-scale comprehensive evaluation benchmark for medical lvlm

Y Hu, T Li, Q Lu, W Shao, J He… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Large Vision-Language Models (LVLMs) have demonstrated remarkable
capabilities in various multimodal tasks. However their potential in the medical domain …

Resunet++: An advanced architecture for medical image segmentation

D Jha, PH Smedsrud, MA Riegler… - … on multimedia (ISM), 2019 - ieeexplore.ieee.org
Accurate computer-aided polyp detection and segmentation during colonoscopy
examinations can help endoscopists resect abnormal tissue and thereby decrease chances …

Kvasir-seg: A segmented polyp dataset

D Jha, PH Smedsrud, MA Riegler, P Halvorsen… - … conference, MMM 2020 …, 2020 - Springer
Pixel-wise image segmentation is a highly demanding task in medical-image analysis. In
practice, it is difficult to find annotated medical images with corresponding segmentation …

Emr-merging: Tuning-free high-performance model merging

C Huang, P Ye, T Chen, T He, X Yue… - Advances in Neural …, 2025 - proceedings.neurips.cc
The success of pretrain-finetune paradigm brings about the release of numerous model
weights. In this case, merging models finetuned on different tasks to enable a single model …

HiFuse: Hierarchical multi-scale feature fusion network for medical image classification

X Huo, G Sun, S Tian, Y Wang, L Yu, J Long… - … Signal Processing and …, 2024 - Elsevier
Effective fusion of global and local multi-scale features is crucial for medical image
classification. Medical images have many noisy, scattered features, intra-class variations …

Real-time polyp detection, localization and segmentation in colonoscopy using deep learning

D Jha, S Ali, NK Tomar, HD Johansen… - Ieee …, 2021 - ieeexplore.ieee.org
Computer-aided detection, localization, and segmentation methods can help improve
colonoscopy procedures. Even though many methods have been built to tackle automatic …

Medmamba: Vision mamba for medical image classification

Y Yue, Z Li - arxiv preprint arxiv:2403.03849, 2024 - arxiv.org
Since the era of deep learning, convolutional neural networks (CNNs) and vision
transformers (ViTs) have been extensively studied and widely used in medical image …