A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

A survey of machine learning and deep learning in remote sensing of geological environment: Challenges, advances, and opportunities

W Han, X Zhang, Y Wang, L Wang, X Huang… - ISPRS Journal of …, 2023 - Elsevier
Due to limited resources and environmental pollution, monitoring the geological
environment has become essential for many countries' sustainable development. As various …

Groupvit: Semantic segmentation emerges from text supervision

J Xu, S De Mello, S Liu, W Byeon… - Proceedings of the …, 2022 - openaccess.thecvf.com
Grou** and recognition are important components of visual scene understanding, eg, for
object detection and semantic segmentation. With end-to-end deep learning systems …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Scaling open-vocabulary image segmentation with image-level labels

G Ghiasi, X Gu, Y Cui, TY Lin - European Conference on Computer Vision, 2022 - Springer
We design an open-vocabulary image segmentation model to organize an image into
meaningful regions indicated by arbitrary texts. Recent works (CLIP and ALIGN), despite …

Token contrast for weakly-supervised semantic segmentation

L Ru, H Zheng, Y Zhan, B Du - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Weakly-Supervised Semantic Segmentation (WSSS) using image-level labels
typically utilizes Class Activation Map (CAM) to generate the pseudo labels. Limited by the …

Satmae: Pre-training transformers for temporal and multi-spectral satellite imagery

Y Cong, S Khanna, C Meng, P Liu… - Advances in …, 2022 - proceedings.neurips.cc
Unsupervised pre-training methods for large vision models have shown to enhance
performance on downstream supervised tasks. Develo** similar techniques for satellite …

Multi-class token transformer for weakly supervised semantic segmentation

L Xu, W Ouyang, M Bennamoun… - Proceedings of the …, 2022 - openaccess.thecvf.com
This paper proposes a new transformer-based framework to learn class-specific object
localization maps as pseudo labels for weakly supervised semantic segmentation (WSSS) …

Learning affinity from attention: End-to-end weakly-supervised semantic segmentation with transformers

L Ru, Y Zhan, B Yu, B Du - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Weakly-supervised semantic segmentation (WSSS) with image-level labels is an important
and challenging task. Due to the high training efficiency, end-to-end solutions for WSSS …

A survey on vision transformer

K Han, Y Wang, H Chen, X Chen, J Guo… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Transformer, first applied to the field of natural language processing, is a type of deep neural
network mainly based on the self-attention mechanism. Thanks to its strong representation …