360sfuda++: Towards source-free uda for panoramic segmentation by learning reliable category prototypes

X Zheng, PY Zhou, AV Vasilakos… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this paper, we address the challenging source-free unsupervised domain adaptation
(SFUDA) for pinhole-to-panoramic semantic segmentation, given only a pinhole image pre …

Image segmentation in foundation model era: A survey

T Zhou, F Zhang, B Chang, W Wang, Y Yuan… - arxiv preprint arxiv …, 2024 - arxiv.org
Image segmentation is a long-standing challenge in computer vision, studied continuously
over several decades, as evidenced by seminal algorithms such as N-Cut, FCN, and …

[PDF][PDF] Segment anything without supervision

X Wang, J Yang, T Darrell - The Thirty-eighth Annual …, 2024 - proceedings.neurips.cc
Abstract The Segmentation Anything Model (SAM) requires labor-intensive data labeling.
We present Unsupervised SAM (UnSAM) for promptable and automatic wholeimage …

Open-vocabulary segmentation with unpaired mask-text supervision

Z Wang, X **a, Z Chen, X He, Y Guo, M Gong… - arxiv preprint arxiv …, 2024 - arxiv.org
Contemporary cutting-edge open-vocabulary segmentation approaches commonly rely on
image-mask-text triplets, yet this restricted annotation is labour-intensive and encounters …

Towards Semantic Equivalence of Tokenization in Multimodal LLM

S Wu, H Fei, X Li, J Ji, H Zhang, TS Chua… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) have demonstrated exceptional capabilities in
processing vision-language tasks. One of the crux of MLLMs lies in vision tokenization …

Generalization Boosted Adapter for Open-Vocabulary Segmentation

W Xu, C Wang, X Feng, R Xu, L Huang… - … on Circuits and …, 2024 - ieeexplore.ieee.org
Vision-language models (VLMs) have demonstrated remarkable open-vocabulary object
recognition capabilities, motivating their adaptation for dense prediction tasks like …

[HTML][HTML] Compact representation for memory-efficient storage of images using genetic algorithm-guided key pixel selection

S Malakar, N Banerjee, DK Prasad - Engineering Applications of Artificial …, 2025 - Elsevier
In the past few years, we have observed rapid growth in digital content. Even in the
biological domain, the arrival of microscopic and nanoscopic images and videos captured …

UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving

D Bogdoll, N Ollick, T Joseph, S Pavlitska… - arxiv preprint arxiv …, 2024 - arxiv.org
Dealing with atypical traffic scenarios remains a challenging task in autonomous driving.
However, most anomaly detection approaches cannot be trained on raw sensor data but …

Onet: Twin U-Net Architecture for Unsupervised Binary Semantic Segmentation in Radar and Remote Sensing Images

Y Zhou, H Su, T Wang, Q Hu - IEEE Transactions on Image …, 2025 - ieeexplore.ieee.org
Segmenting objects from cluttered backgrounds in single-channel images, such as marine
radar echoes, medical images, and remote sensing images, poses significant challenges …

A dual-branch and dual attention transformer and CNN hybrid network for ultrasound image segmentation

C Zhang, L Wang, G Wei, Z Kong, M Qiu - Frontiers in Physiology, 2024 - frontiersin.org
Introduction Ultrasound imaging has become a crucial tool in medical diagnostics, offering
real-time visualization of internal organs and tissues. However, challenges such as low …