Sam2-adapter: Evaluating & adapting segment anything 2 in downstream tasks: Camouflage, shadow, medical image segmentation, and more
The advent of large models, also known as foundation models, has significantly transformed
the AI research landscape, with models like Segment Anything (SAM) achieving notable …
the AI research landscape, with models like Segment Anything (SAM) achieving notable …
Pptformer: Pseudo multi-perspective transformer for uav segmentation
The ascension of Unmanned Aerial Vehicles (UAVs) in various fields necessitates effective
UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV …
UAV image segmentation, which faces challenges due to the dynamic perspectives of UAV …
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
Open-vocabulary 3D segmentation enables exploration of 3D spaces using free-form text
descriptions. Existing methods for open-vocabulary 3D instance segmentation primarily …
descriptions. Existing methods for open-vocabulary 3D instance segmentation primarily …
Structural and Statistical Texture Knowledge Distillation and Learning for Segmentation
We propose to re-emphasize the low-level texture information in deep networks for semantic
segmentation and related knowledge distillation tasks. Low-level texture feature/knowledge …
segmentation and related knowledge distillation tasks. Low-level texture feature/knowledge …
Not Every Patch is Needed: Towards a More Efficient and Effective Backbone for Video-based Person Re-identification
This paper proposes a new effective and efficient plug-and-play backbone for video-based
person re-identification (ReID). Conventional video-based ReID methods typically use CNN …
person re-identification (ReID). Conventional video-based ReID methods typically use CNN …
Multimodal 3D Reasoning Segmentation with Complex Scenes
The recent development in multimodal learning has greatly advanced the research in 3D
scene understanding in various real-world tasks such as embodied AI. However, most …
scene understanding in various real-world tasks such as embodied AI. However, most …
Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches
Y Zang, R Cao, J Zhang, Y Han, Z Cao, W Hu… - arxiv preprint arxiv …, 2025 - arxiv.org
Sketches, with their expressive potential, allow humans to convey the essence of an object
through even a rough contour. For the first time, we harness this expressive potential to …
through even a rough contour. For the first time, we harness this expressive potential to …
Discrete Latent Perspective Learning for Segmentation and Detection
In this paper, we address the challenge of Perspective-Invariant Learning in machine
learning and computer vision, which involves enabling a network to understand images from …
learning and computer vision, which involves enabling a network to understand images from …