A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

Self-supervised representation learning: Introduction, advances, and challenges

L Ericsson, H Gouk, CC Loy… - IEEE Signal Processing …, 2022 - ieeexplore.ieee.org
Self-supervised representation learning (SSRL) methods aim to provide powerful, deep
feature learning without the requirement of large annotated data sets, thus alleviating the …

Self-supervised predictive convolutional attentive block for anomaly detection

NC Ristea, N Madan, RT Ionescu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Anomaly detection is commonly pursued as a one-class classification problem, where
models can only learn from normal training samples, while being evaluated on both normal …

Self-supervised co-training for video representation learning

T Han, W **e, A Zisserman - Advances in neural information …, 2020 - proceedings.neurips.cc
The objective of this paper is visual-only self-supervised video representation learning. We
make the following contributions:(i) we investigate the benefit of adding semantic-class …

Videomoco: Contrastive video representation learning with temporally adversarial examples

T Pan, Y Song, T Yang, W Jiang… - Proceedings of the …, 2021 - openaccess.thecvf.com
MoCo is effective for unsupervised image representation learning. In this paper, we propose
VideoMoCo for unsupervised video representation learning. Given a video sequence as an …

Self-supervised video representation learning by pace prediction

J Wang, J Jiao, YH Liu - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
This paper addresses the problem of self-supervised video representation learning from a
new perspective–by video pace prediction. It stems from the observation that human visual …

Memory-augmented dense predictive coding for video representation learning

T Han, W **e, A Zisserman - European conference on computer vision, 2020 - Springer
The objective of this paper is self-supervised learning from video, in particular for
representations for action recognition. We make the following contributions:(i) We propose a …

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

Tclr: Temporal contrastive learning for video representation

I Dave, R Gupta, MN Rizve, M Shah - Computer Vision and Image …, 2022 - Elsevier
Contrastive learning has nearly closed the gap between supervised and self-supervised
learning of image representations, and has also been explored for videos. However, prior …

STST: Spatial-temporal specialized transformer for skeleton-based action recognition

Y Zhang, B Wu, W Li, L Duan, C Gan - Proceedings of the 29th ACM …, 2021 - dl.acm.org
Skeleton-based action recognition has been widely investigated considering their strong
adaptability to dynamic circumstances and complicated backgrounds. To recognize different …