Learning multi-axis representation in frequency domain for medical image segmentation

J Ruan, J Gao, M **e, S **ang - Machine Learning, 2025 - Springer
Abstract Recently, Visual Transformer (ViT) has been extensively used in medical image
segmentation (MIS) due to applying self-attention mechanism in the spatial domain to …

Towards discriminative representation with meta-learning for colonoscopic polyp re-identification

S **ang, Q Chen, S Cai, C Zhou, C Cai, S Du… - arxiv preprint arxiv …, 2023 - arxiv.org
Colonoscopic Polyp Re-Identification aims to match the same polyp from a large gallery with
images from different views taken using different cameras and plays an important role in the …

CLAPP: Contrastive Language-Audio Pre-training in Passive Underwater Vessel Classification

Z Li, J Gao, T Yu, S **ang, J Ruan, T Liu… - arxiv preprint arxiv …, 2024 - arxiv.org
Existing research on audio classification faces challenges in recognizing attributes of
passive underwater vessel scenarios and lacks well-annotated datasets due to data privacy …

VT-ReID: Learning Discriminative Visual-Text Representation for Polyp Re-Identification

S **ang, C Liu, J Ruan, S Cai, S Du… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Colonoscopic Polyp Re-Identification (ReID) aims to match a specific polyp in a large gallery
with different cameras and views, which plays a key role in the prevention and treatment of …

Self-Supervised Visual Representation Learning for Medical Image Analysis: A Comprehensive Survey

S Manna, S Bhattacharya, U Pal - Transactions on Machine Learning … - openreview.net
Deep learning has developed as a great tool for many computer vision or natural language
processing tasks. However, supervised deep learning algorithms require a large amount of …