Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation
In this paper we first assess and harness various Vision Foundation Models (VFMs) in the
context of Domain Generalized Semantic Segmentation (DGSS). Driven by the motivation …
context of Domain Generalized Semantic Segmentation (DGSS). Driven by the motivation …
Frequency-spatial entanglement learning for camouflaged object detection
Camouflaged object detection has attracted a lot of attention in computer vision. The main
challenge lies in the high degree of similarity between camouflaged objects and their …
challenge lies in the high degree of similarity between camouflaged objects and their …
Dginstyle: Domain-generalizable semantic segmentation with image diffusion models and stylized semantic control
Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability
to generate creative content, specialize to user data through few-shot fine-tuning, and …
to generate creative content, specialize to user data through few-shot fine-tuning, and …
Learning content-enhanced mask transformer for domain generalized urban-scene segmentation
Domain-generalized urban-scene semantic segmentation (USSS) aims to learn generalized
semantic predictions across diverse urban-scene styles. Unlike generic domain gap …
semantic predictions across diverse urban-scene styles. Unlike generic domain gap …
Calibration-based multi-prototype contrastive learning for domain generalization semantic segmentation in traffic scenes
Prototypical contrastive learning (PCL) has been widely used to learn class-wise domain-
invariant features for domain generalization semantic segmentation. These methods …
invariant features for domain generalization semantic segmentation. These methods …
Mgmap: Mask-guided learning for online vectorized hd map construction
Currently high-definition (HD) map construction leans towards a lightweight online
generation tendency which aims to preserve timely and reliable road scene information …
generation tendency which aims to preserve timely and reliable road scene information …
VLTSeg: Simple transfer of CLIP-based vision-language representations for domain generalized semantic segmentation
Domain generalization (DG) remains a significant challenge for perception based on deep
neural networks (DNN), where domain shifts occur due to lighting, weather, or geolocation …
neural networks (DNN), where domain shifts occur due to lighting, weather, or geolocation …
Learning spectral-decomposited tokens for domain generalized semantic segmentation
The rapid development of Vision Foundation Model (VFM) brings inherent out-domain
generalization for a variety of down-stream tasks. Among them, domain generalized …
generalization for a variety of down-stream tasks. Among them, domain generalized …
GPT4Ego: unleashing the potential of pre-trained models for zero-shot egocentric action recognition
Vision-Language Models (VLMs), pre-trained on large-scale datasets, have shown
impressive performance in various visual recognition tasks. This advancement paves the …
impressive performance in various visual recognition tasks. This advancement paves the …
Learning generalized segmentation for foggy-scenes by bi-directional wavelet guidance
Learning scene semantics that can be well generalized to foggy conditions is important for
safety-crucial applications such as autonomous driving. Existing methods need both …
safety-crucial applications such as autonomous driving. Existing methods need both …