Visual semantic segmentation based on few/zero-shot learning: An overview

W Ren, Y Tang, Q Sun, C Zhao… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
Visual semantic segmentation aims at separating a visual sample into diverse blocks with
specific semantic attributes and identifying the category for each block, and it plays a crucial …

Srformer: Permuted self-attention for single image super-resolution

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

Vscode: General visual salient and camouflaged object detection with 2d prompt learning

Z Luo, N Liu, W Zhao, X Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Salient object detection (SOD) and camouflaged object detection (COD) are related yet
distinct binary map** tasks. These tasks involve multiple modalities sharing …

A survey on deep learning technique for video segmentation

T Zhou, F Porikli, DJ Crandall… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …

Autosam: Adapting sam to medical images by overloading the prompt encoder

T Shaharabany, A Dahan, R Giryes, L Wolf - arxiv preprint arxiv …, 2023 - arxiv.org
The recently introduced Segment Anything Model (SAM) combines a clever architecture and
large quantities of training data to obtain remarkable image segmentation capabilities …

Video transformers: A survey

J Selva, AS Johansen, S Escalera… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Transformer models have shown great success handling long-range interactions, making
them a promising tool for modeling video. However, they lack inductive biases and scale …

Full-duplex strategy for video object segmentation

GP Ji, K Fu, Z Wu, DP Fan, J Shen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …

Video polyp segmentation: A deep learning perspective

GP Ji, G **ao, YC Chou, DP Fan, K Zhao… - Machine Intelligence …, 2022 - Springer
We present the first comprehensive video polyp segmentation (VPS) study in the deep
learning era. Over the years, developments in VPS are not moving forward with ease due to …

Siamese network for RGB-D salient object detection and beyond

K Fu, DP Fan, GP Ji, Q Zhao, J Shen… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as
independent information and design separate networks for feature extraction from each …

Camoformer: Masked separable attention for camouflaged object detection

B Yin, X Zhang, DP Fan, S Jiao… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
How to identify and segment camouflaged objects from the background is challenging.
Inspired by the multi-head self-attention in Transformers, we present a simple masked …