Consequential advancements of self-supervised learning (SSL) in deep learning contexts

MM Abdulrazzaq, NTA Ramaha, AA Hameed… - Mathematics, 2024 - mdpi.com
Self-supervised learning (SSL) is a potential deep learning (DL) technique that uses
massive volumes of unlabeled data to train neural networks. SSL techniques have evolved …

Self-supervised learning of object parts for semantic segmentation

A Ziegler, YM Asano - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Progress in self-supervised learning has brought strong general image representation
learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks …

Geometric visual similarity learning in 3d medical image self-supervised pre-training

Y He, G Yang, R Ge, Y Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning inter-image similarity is crucial for 3D medical images self-supervised pre-training,
due to their sharing of numerous same semantic regions. However, the lack of the semantic …

Mutual contrastive learning for visual representation learning

C Yang, Z An, L Cai, Y Xu - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org
We present a collaborative learning method called Mutual Contrastive Learning (MCL) for
general visual representation learning. The core idea of MCL is to perform mutual interaction …

Multi-view correlation distillation for incremental object detection

D Yang, Y Zhou, A Zhang, X Sun, D Wu, W Wang… - Pattern Recognition, 2022 - Elsevier
In real applications, new object classes often emerge after the detection model has been
trained on a prepared dataset with fixed classes. Fine-tuning the old model with only new …

Transfgu: a top-down approach to fine-grained unsupervised semantic segmentation

Z Yin, P Wang, F Wang, X Xu, H Zhang, H Li… - European conference on …, 2022 - Springer
Unsupervised semantic segmentation aims to obtain high-level semantic representation on
low-level visual features without manual annotations. Most existing methods are bottom-up …

Contrastive conditional latent diffusion for audio-visual segmentation

Y Mao, J Zhang, M **ang, Y Lv, Y Zhong… - arxiv preprint arxiv …, 2023 - arxiv.org
We propose a latent diffusion model with contrastive learning for audio-visual segmentation
(AVS) to extensively explore the contribution of audio. We interpret AVS as a conditional …

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Y Shu, W Zeng, Z Li, F Zhao, Y Zhou - arxiv preprint arxiv:2402.03082, 2024 - arxiv.org
Visual text, a pivotal element in both document and scene images, speaks volumes and
attracts significant attention in the computer vision domain. Beyond visual text detection and …

Weakly-supervised contrastive learning for unsupervised object discovery

Y Lv, J Zhang, N Barnes, Y Dai - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
Unsupervised object discovery (UOD) refers to the task of discriminating the whole region of
objects from the background within a scene without relying on labeled datasets, which …

Perceiving ambiguity and semantics without recognition: an efficient and effective ambiguous scene text detector

Y Shu, W Wang, Y Zhou, S Liu, A Zhang… - Proceedings of the 31st …, 2023 - dl.acm.org
Ambiguous scene text detection is an extremely challenging task. Existing text detectors that
rely solely on visual cues often suffer from confusion due to being evenly distributed in …