A survey on self-supervised learning: Algorithms, applications, and future trends

J Gui, T Chen, J Zhang, Q Cao, Z Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep supervised learning algorithms typically require a large volume of labeled data to
achieve satisfactory performance. However, the process of collecting and labeling such data …

Self-supervised learning for videos: A survey

MC Schiappa, YS Rawat, M Shah - ACM Computing Surveys, 2023 - dl.acm.org
The remarkable success of deep learning in various domains relies on the availability of
large-scale annotated datasets. However, obtaining annotations is expensive and requires …

Verbs in action: Improving verb understanding in video-language models

L Momeni, M Caron, A Nagrani… - Proceedings of the …, 2023 - openaccess.thecvf.com
Understanding verbs is crucial to modelling how people and objects interact with each other
and the environment through space and time. Recently, state-of-the-art video-language …

Spatiotemporal contrastive video representation learning

R Qian, T Meng, B Gong, MH Yang… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present a self-supervised Contrastive Video Representation Learning (CVRL) method to
learn spatiotemporal visual representations from unlabeled videos. Our representations are …

Videomoco: Contrastive video representation learning with temporally adversarial examples

T Pan, Y Song, T Yang, W Jiang… - Proceedings of the …, 2021 - openaccess.thecvf.com
MoCo is effective for unsupervised image representation learning. In this paper, we propose
VideoMoCo for unsupervised video representation learning. Given a video sequence as an …

CAS(ME)3: A Third Generation Facial Spontaneous Micro-Expression Database With Depth Information and High Ecological Validity

J Li, Z Dong, S Lu, SJ Wang, WJ Yan… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org
Micro-expression (ME) is a significant non-verbal communication clue that reveals one
person's genuine emotional state. The development of micro-expression analysis (MEA) has …

Self-supervised video representation learning by pace prediction

J Wang, J Jiao, YH Liu - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
This paper addresses the problem of self-supervised video representation learning from a
new perspective–by video pace prediction. It stems from the observation that human visual …

Tcgl: Temporal contrastive graph for self-supervised video representation learning

Y Liu, K Wang, L Liu, H Lan, L Lin - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video self-supervised learning is a challenging task, which requires significant expressive
power from the model to leverage rich spatial-temporal knowledge and generate effective …

Tclr: Temporal contrastive learning for video representation

I Dave, R Gupta, MN Rizve, M Shah - Computer Vision and Image …, 2022 - Elsevier
Contrastive learning has nearly closed the gap between supervised and self-supervised
learning of image representations, and has also been explored for videos. However, prior …

A survey on deep learning in medical image registration: New technologies, uncertainty, evaluation metrics, and beyond

J Chen, Y Liu, S Wei, Z Bian, S Subramanian… - Medical Image …, 2024 - Elsevier
Deep learning technologies have dramatically reshaped the field of medical image
registration over the past decade. The initial developments, such as regression-based and U …