Semi-supervised semantic segmentation with pixel-level contrastive learning from a class-wise memory bank

I Alonso, A Sabater, D Ferstl… - Proceedings of the …, 2021 - openaccess.thecvf.com
This work presents a novel approach for semi-supervised semantic segmentation. The key
element of this approach is our contrastive learning module that enforces the segmentation …

Transmix: Attend to mix for vision transformers

JN Chen, S Sun, J He, PHS Torr… - Proceedings of the …, 2022 - openaccess.thecvf.com
Mixup-based augmentation has been found to be effective for generalizing models during
training, especially for Vision Transformers (ViTs) since they can easily overfit. However …

[HTML][HTML] PatchMask: A data augmentation strategy with Gaussian noise in hyperspectral images

HX Dou, XS Lu, C Wang, HZ Shen, YW Zhuo, LJ Deng - Remote Sensing, 2022 - mdpi.com
Data augmentation (DA) is an effective way to enrich the richness of data and improve a
model's generalization ability. It has been widely used in many advanced vision tasks (eg …

Joint learning of RGBW color filter arrays and demosaicking

C Bai, F Liu, J Li - Pattern Recognition, 2025 - Elsevier
RGBW color filter arrays (CFAs) have gained widespread attention for their superior
performance in low-light conditions. Most existing demosaicking methods are tailored for …

Samplingaug: On the importance of patch sampling augmentation for single image super-resolution

S Wang, M Lu, K Chen, J Liu, X Li, M Wu - arxiv preprint arxiv …, 2021 - arxiv.org
With the development of Deep Neural Networks (DNNs), plenty of methods based on DNNs
have been proposed for Single Image Super-Resolution (SISR). However, existing methods …

Isp meets deep learning: A survey on deep learning methods for image signal processing

MHM da Silva, JVS da Silva, RR Arrais… - arxiv preprint arxiv …, 2023 - arxiv.org
The entire Image Signal Processor (ISP) of a camera relies on several processes to
transform the data from the Color Filter Array (CFA) sensor, such as demosaicing, denoising …

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery

R Zhang, L Du, J Liu, C Song, F Wang, X Li… - Proceedings of the 33rd …, 2023 - dl.acm.org
Recently, content-aware methods have been utilized to reduce the bandwidth and improve
the quality of Internet video delivery. Existing methods train corresponding content-aware …

[PDF][PDF] Overcoming Degradation Imbalance for Consistent Image Dehazing.

P Shyam, H Yoo - BMVC, 2023 - papers.bmvc2023.org
MSE or MAE loss functions work under the premise of all pixels having equal contribution
during optimization. However, natural haze degradations are non-homogeneous, resulting …

Towards unified visual perception

S Sun - 2024 - ora.ox.ac.uk
This thesis explores the frontier of visual perception in computer vision by leveraging the
capabilities of Vision Transformers (ViTs) to create a unified framework that addresses cross …

ISP Meets Deep Learning: A Survey on Deep Learning Methods for Image Signal Processing

CFG Santos, RR Arrais, JVS Silva, MHM Silva… - ACM Computing …, 2021 - dl.acm.org
The entire Image Signal Processor (ISP) of a camera relies on several processes to
transform the data from the Color Filter Array (CFA) sensor, such as demosaicing, denoising …