Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation

Z Wei, L Chen, Y **, X Ma, T Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
In this paper we first assess and harness various Vision Foundation Models (VFMs) in the
context of Domain Generalized Semantic Segmentation (DGSS). Driven by the motivation …

Frequency-spatial entanglement learning for camouflaged object detection

Y Sun, C Xu, J Yang, H Xuan, L Luo - European Conference on Computer …, 2024 - Springer
Camouflaged object detection has attracted a lot of attention in computer vision. The main
challenge lies in the high degree of similarity between camouflaged objects and their …

Dginstyle: Domain-generalizable semantic segmentation with image diffusion models and stylized semantic control

Y Jia, L Hoyer, S Huang, T Wang, L Van Gool… - … on Computer Vision, 2024 - Springer
Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability
to generate creative content, specialize to user data through few-shot fine-tuning, and …

Learning content-enhanced mask transformer for domain generalized urban-scene segmentation

Q Bi, S You, T Gevers - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Domain-generalized urban-scene semantic segmentation (USSS) aims to learn generalized
semantic predictions across diverse urban-scene styles. Unlike generic domain gap …

Calibration-based multi-prototype contrastive learning for domain generalization semantic segmentation in traffic scenes

M Liao, S Tian, Y Zhang, G Hua… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Prototypical contrastive learning (PCL) has been widely used to learn class-wise domain-
invariant features for domain generalization semantic segmentation. These methods …

Mgmap: Mask-guided learning for online vectorized hd map construction

X Liu, S Wang, W Li, R Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Currently high-definition (HD) map construction leans towards a lightweight online
generation tendency which aims to preserve timely and reliable road scene information …

VLTSeg: Simple transfer of CLIP-based vision-language representations for domain generalized semantic segmentation

C Hümmer, M Schwonberg, L Zhou, H Cao… - arxiv preprint arxiv …, 2023 - arxiv.org
Domain generalization (DG) remains a significant challenge for perception based on deep
neural networks (DNN), where domain shifts occur due to lighting, weather, or geolocation …

Learning spectral-decomposited tokens for domain generalized semantic segmentation

J Yi, Q Bi, H Zheng, H Zhan, W Ji, Y Huang… - Proceedings of the …, 2024 - dl.acm.org
The rapid development of Vision Foundation Model (VFM) brings inherent out-domain
generalization for a variety of down-stream tasks. Among them, domain generalized …

GPT4Ego: unleashing the potential of pre-trained models for zero-shot egocentric action recognition

G Dai, X Shu, W Wu, R Yan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Vision-Language Models (VLMs), pre-trained on large-scale datasets, have shown
impressive performance in various visual recognition tasks. This advancement paves the …

Learning generalized segmentation for foggy-scenes by bi-directional wavelet guidance

Q Bi, S You, T Gevers - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Learning scene semantics that can be well generalized to foggy conditions is important for
safety-crucial applications such as autonomous driving. Existing methods need both …