Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Teaching matters: Investigating the role of supervision in vision transformers

M Walmer, S Suri, K Gupta… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Vision Transformers (ViTs) have gained significant popularity in recent years and
have proliferated into many applications. However, their behavior under different learning …

On the stepwise nature of self-supervised learning

JB Simon, M Knutins, L Ziyin, D Geisz… - International …, 2023 - proceedings.mlr.press
We present a simple picture of the training process of self-supervised learning methods with
dual deep networks. In our picture, these methods learn their high-dimensional embeddings …

Understanding cross-domain few-shot learning based on domain similarity and few-shot difficulty

J Oh, S Kim, N Ho, JH Kim, H Song… - Advances in Neural …, 2022 - proceedings.neurips.cc
Cross-domain few-shot learning (CD-FSL) has drawn increasing attention for handling large
differences between the source and target domains--an important concern in real-world …

Convnet vs transformer, supervised vs clip: Beyond imagenet accuracy

K Vishniakov, Z Shen, Z Liu - ar**
FC Ogidi, MG Eramian, I Stavness - Plant Phenomics, 2023 - spj.science.org
The rise of self-supervised learning (SSL) methods in recent years presents an opportunity
to leverage unlabeled and domain-specific datasets generated by image-based plant …