- Academic Search

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Save Cite Cited by 401 Related articles All 10 versions Free GPT-4

[Free GPT-4]

[PDF] cell.com Full View

Audio self-supervised learning: A survey

S Liu, A Mallol-Ragolta, E Parada-Cabaleiro, K Qian… - Patterns, 2022 - cell.com

Similar to humans' cognitive ability to generalize knowledge and skills, self-supervised
learning (SSL) targets discovering general representations from large-scale data. This …

Save Cite Cited by 127 Related articles All 12 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

wav2vec: Unsupervised pre-training for speech recognition

S Schneider, A Baevski, R Collobert, M Auli - arxiv preprint arxiv …, 2019 - arxiv.org

We explore unsupervised pre-training for speech recognition by learning representations of
raw audio. wav2vec is trained on large amounts of unlabeled audio data and the resulting …

Save Cite Cited by 1727 Related articles All 12 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] neurips.cc

Unsupervised speech recognition

A Baevski, WN Hsu, A Conneau… - Advances in Neural …, 2021 - proceedings.neurips.cc

Despite rapid progress in the recent past, current speech recognition systems still require
labeled training data which limits this technology to a small fraction of the languages spoken …

Save Cite Cited by 331 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Unsupervised speech representation learning using wavenet autoencoders

J Chorowski, RJ Weiss, S Bengio… - … /ACM transactions on …, 2019 - ieeexplore.ieee.org

We consider the task of unsupervised extraction of meaningful latent representations of
speech by applying autoencoding neural networks to speech waveforms. The goal is to …

Save Cite Cited by 414 Related articles All 11 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Deep partial multi-view learning

C Zhang, Y Cui, Z Han, JT Zhou… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

Although multi-view learning has made significant progress over the past few decades, it is
still challenging due to the difficulty in modeling complex correlations among different views …

Save Cite Cited by 247 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Libri-light: A benchmark for asr with limited or no supervision

J Kahn, M Riviere, W Zheng… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

We introduce a new collection of spoken English audio suitable for training speech
recognition systems under limited or no supervision. It is derived from open-source audio …

Save Cite Cited by 722 Related articles All 13 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Unified speech-text pre-training for speech translation and recognition

Y Tang, H Gong, N Dong, C Wang, WN Hsu… - arxiv preprint arxiv …, 2022 - arxiv.org

We describe a method to jointly pre-train speech and text in an encoder-decoder modeling
framework for speech translation and recognition. The proposed method incorporates four …

Save Cite Cited by 83 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Towards end-to-end unsupervised speech recognition

AH Liu, WN Hsu, M Auli… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

Unsupervised speech recognition has shown great potential to make Automatic Speech
Recognition (ASR) systems accessible to every language. However, existing methods still …

Save Cite Cited by 81 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Firerisk: A remote sensing dataset for fire risk assessment with benchmarks using supervised and self-supervised learning

S Shen, S Seneviratne, X Wanyan… - … Conference on Digital …, 2023 - ieeexplore.ieee.org

In recent decades, wildfires have caused tremendous property losses, fatalities, and
extensive damage to forest ecosystems. Inspired by the abundance of publicly available …

Save Cite Cited by 367 Related articles All 8 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Unsupervised cross-modal alignment of speech and text embedding spaces

Self-supervised speech representation learning: A review

Audio self-supervised learning: A survey

wav2vec: Unsupervised pre-training for speech recognition

Unsupervised speech recognition

Unsupervised speech representation learning using wavenet autoencoders

Deep partial multi-view learning

Libri-light: A benchmark for asr with limited or no supervision

Unified speech-text pre-training for speech translation and recognition

Towards end-to-end unsupervised speech recognition

Firerisk: A remote sensing dataset for fire risk assessment with benchmarks using supervised and self-supervised learning