Google Académico

360sfuda++: Towards source-free uda for panoramic segmentation by learning reliable category prototypes

X Zheng, PY Zhou, AV Vasilakos… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In this paper, we address the challenging source-free unsupervised domain adaptation
(SFUDA) for pinhole-to-panoramic semantic segmentation, given only a pinhole image pre …

Guardar Citar Citado por 3 Artículos relacionados Las 2 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Image segmentation in foundation model era: A survey

T Zhou, F Zhang, B Chang, W Wang, Y Yuan… - arxiv preprint arxiv …, 2024 - arxiv.org

Image segmentation is a long-standing challenge in computer vision, studied continuously
over several decades, as evidenced by seminal algorithms such as N-Cut, FCN, and …

Guardar Citar Citado por 5 Artículos relacionados Las 3 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

[PDF][PDF] Segment anything without supervision

X Wang, J Yang, T Darrell - The Thirty-eighth Annual …, 2024 - proceedings.neurips.cc

Abstract The Segmentation Anything Model (SAM) requires labor-intensive data labeling.
We present Unsupervised SAM (UnSAM) for promptable and automatic wholeimage …

Guardar Citar Citado por 3 Artículos relacionados Las 4 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Open-vocabulary segmentation with unpaired mask-text supervision

Z Wang, X **a, Z Chen, X He, Y Guo, M Gong… - arxiv preprint arxiv …, 2024 - arxiv.org

Contemporary cutting-edge open-vocabulary segmentation approaches commonly rely on
image-mask-text triplets, yet this restricted annotation is labour-intensive and encounters …

Guardar Citar Citado por 9 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards Semantic Equivalence of Tokenization in Multimodal LLM

S Wu, H Fei, X Li, J Ji, H Zhang, TS Chua… - arxiv preprint arxiv …, 2024 - arxiv.org

Multimodal Large Language Models (MLLMs) have demonstrated exceptional capabilities in
processing vision-language tasks. One of the crux of MLLMs lies in vision tokenization …

Guardar Citar Citado por 41 Artículos relacionados Las 2 versiones Versión en HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generalization Boosted Adapter for Open-Vocabulary Segmentation

W Xu, C Wang, X Feng, R Xu, L Huang… - … on Circuits and …, 2024 - ieeexplore.ieee.org

Vision-language models (VLMs) have demonstrated remarkable open-vocabulary object
recognition capabilities, motivating their adaptation for dense prediction tasks like …

Guardar Citar Citado por 1 Artículos relacionados Las 4 versiones

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Compact representation for memory-efficient storage of images using genetic algorithm-guided key pixel selection

S Malakar, N Banerjee, DK Prasad - Engineering Applications of Artificial …, 2025 - Elsevier

In the past few years, we have observed rapid growth in digital content. Even in the
biological domain, the arrival of microscopic and nanoscopic images and videos captured …

Guardar Citar Artículos relacionados Las 3 versiones

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving

D Bogdoll, N Ollick, T Joseph, S Pavlitska… - arxiv preprint arxiv …, 2024 - arxiv.org

Dealing with atypical traffic scenarios remains a challenging task in autonomous driving.
However, most anomaly detection approaches cannot be trained on raw sensor data but …

Guardar Citar Citado por 1 Artículos relacionados Las 2 versiones Versión en HTML

Onet: Twin U-Net Architecture for Unsupervised Binary Semantic Segmentation in Radar and Remote Sensing Images

Y Zhou, H Su, T Wang, Q Hu - IEEE Transactions on Image …, 2025 - ieeexplore.ieee.org

Segmenting objects from cluttered backgrounds in single-channel images, such as marine
radar echoes, medical images, and remote sensing images, poses significant challenges …

Guardar Citar Artículos relacionados

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

A dual-branch and dual attention transformer and CNN hybrid network for ultrasound image segmentation

C Zhang, L Wang, G Wei, Z Kong, M Qiu - Frontiers in Physiology, 2024 - frontiersin.org

Introduction Ultrasound imaging has become a crucial tool in medical diagnostics, offering
real-time visualization of internal organs and tissues. However, challenges such as low …

Guardar Citar Artículos relacionados Las 3 versiones En caché

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Unsupervised universal image segmentation

360sfuda++: Towards source-free uda for panoramic segmentation by learning reliable category prototypes

Image segmentation in foundation model era: A survey

[PDF][PDF] Segment anything without supervision

Open-vocabulary segmentation with unpaired mask-text supervision

Towards Semantic Equivalence of Tokenization in Multimodal LLM

Generalization Boosted Adapter for Open-Vocabulary Segmentation

[HTML][HTML] Compact representation for memory-efficient storage of images using genetic algorithm-guided key pixel selection

UMAD: Unsupervised Mask-Level Anomaly Detection for Autonomous Driving

Onet: Twin U-Net Architecture for Unsupervised Binary Semantic Segmentation in Radar and Remote Sensing Images

A dual-branch and dual attention transformer and CNN hybrid network for ultrasound image segmentation