- Academic Search

SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative...

S Liu, A Mallol-Ragolta, E Parada-Cabaleiro, K Qian… - Patterns, 2022 - cell.com

Similar to humans' cognitive ability to generalize knowledge and skills, self-supervised
learning (SSL) targets discovering general representations from large-scale data. This …

Save Cite Cited by 127 Related articles All 12 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Wavlm: Large-scale self-supervised pre-training for full stack speech processing

S Chen, C Wang, Z Chen, Y Wu, S Liu… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Self-supervised learning (SSL) achieves great success in speech recognition, while limited
exploration has been attempted for other speech processing tasks. As speech signal …

Save Cite Cited by 1820 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Comparative layer-wise analysis of self-supervised speech models

A Pasad, B Shi, K Livescu - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

Many self-supervised speech models, varying in their pre-training objective, input modality,
and pre-training data, have been proposed in the last few years. Despite impressive …

Save Cite Cited by 112 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Ml-superb: Multilingual speech universal performance benchmark

J Shi, D Berrebbi, W Chen, HL Chung, EP Hu… - arxiv preprint arxiv …, 2023 - arxiv.org

Speech processing Universal PERformance Benchmark (SUPERB) is a leaderboard to
benchmark the performance of Self-Supervised Learning (SSL) models on various speech …

Save Cite Cited by 59 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Generative pre-training for speech with flow matching

AH Liu, M Le, A Vyas, B Shi, A Tjandra… - arxiv preprint arxiv …, 2023 - arxiv.org

Generative models have gained more and more attention in recent years for their
remarkable success in tasks that required estimating and sampling data distribution to …

Save Cite Cited by 25 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A survey of reasoning with foundation models

J Sun, C Zheng, E **e, Z Liu, R Chu, J Qiu, J Xu… - arxiv preprint arxiv …, 2023 - arxiv.org

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …

Save Cite Cited by 39 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Speechprompt: An exploration of prompt tuning on generative spoken language model for speech processing tasks

KW Chang, WC Tseng, SW Li, H Lee - arxiv preprint arxiv:2203.16773, 2022 - arxiv.org

Speech representations learned from Self-supervised learning (SSL) models can benefit
various speech processing tasks. However, utilizing SSL representations usually requires …

Save Cite Cited by 54 Related articles All 6 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

Superb@ slt 2022: Challenge on generalization and efficiency of self-supervised speech representation learning

T Feng, A Dong, CF Yeh, S Yang, TQ Lin… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised
speech representation for better performance, generalization, and efficiency. The challenge …

Save Cite Cited by 34 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Speech self-supervised representation benchmarking: Are we doing it right?

S Zaiem, Y Kemiche, T Parcollet, S Essid… - arxiv preprint arxiv …, 2023 - arxiv.org

Self-supervised learning (SSL) has recently allowed leveraging large datasets of unlabeled
speech signals to reach impressive performance on speech tasks using only small amounts …

Save Cite Cited by 32 Related articles All 8 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

On the utility of self-supervised models for prosody-related tasks

GT Lin, CL Feng, WP Huang, Y Tseng… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

Self-Supervised Learning (SSL) from speech data has produced models that have achieved
remarkable performance in many tasks, and that are known to implicitly represent many …

Save Cite Cited by 47 Related articles All 4 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative...

Audio self-supervised learning: A survey

Wavlm: Large-scale self-supervised pre-training for full stack speech processing

Comparative layer-wise analysis of self-supervised speech models

Ml-superb: Multilingual speech universal performance benchmark

Generative pre-training for speech with flow matching

A survey of reasoning with foundation models

Speechprompt: An exploration of prompt tuning on generative spoken language model for speech processing tasks

Superb@ slt 2022: Challenge on generalization and efficiency of self-supervised speech representation learning

Speech self-supervised representation benchmarking: Are we doing it right?

On the utility of self-supervised models for prosody-related tasks