محقق Google

SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative...

S Liu, A Mallol-Ragolta, E Parada-Cabaleiro, K Qian… - Patterns, 2022‏ - cell.com‏

Similar to humans' cognitive ability to generalize knowledge and skills, self-supervised
learning (SSL) targets discovering general representations from large-scale data. This …‏

ذخیره ارجاع بیان شده در 130 یافته مقاله‌های مربوط تمام نسخه‌های 14

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Wavlm: Large-scale self-supervised pre-training for full stack speech processing‏

S Chen, C Wang, Z Chen, Y Wu, S Liu… - IEEE Journal of …, 2022‏ - ieeexplore.ieee.org‏

Self-supervised learning (SSL) achieves great success in speech recognition, while limited
exploration has been attempted for other speech processing tasks. As speech signal …‏

ذخیره ارجاع بیان شده در 1862 یافته مقاله‌های مربوط تمام نسخه‌های 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Comparative layer-wise analysis of self-supervised speech models‏

A Pasad, B Shi, K Livescu - ICASSP 2023-2023 IEEE …, 2023‏ - ieeexplore.ieee.org‏

Many self-supervised speech models, varying in their pre-training objective, input modality,
and pre-training data, have been proposed in the last few years. Despite impressive …‏

ذخیره ارجاع بیان شده در 109 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of reasoning with foundation models‏

J Sun, C Zheng, E **e, Z Liu, R Chu, J Qiu, J Xu… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …‏

ذخیره ارجاع بیان شده در 47 یافته مقاله‌های مربوط تمام نسخه‌های 4 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Ml-superb: Multilingual speech universal performance benchmark‏

J Shi, D Berrebbi, W Chen, HL Chung, EP Hu… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Speech processing Universal PERformance Benchmark (SUPERB) is a leaderboard to
benchmark the performance of Self-Supervised Learning (SSL) models on various speech …‏

ذخیره ارجاع بیان شده در 61 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A large-scale evaluation of speech foundation models‏

S Yang, HJ Chang, Z Huang, AT Liu… - … on Audio, Speech …, 2024‏ - ieeexplore.ieee.org‏

The foundation model paradigm leverages a shared foundation model to achieve state-of-
the-art (SOTA) performance for various tasks, requiring minimal downstream-specific data …‏

ذخیره ارجاع بیان شده در 20 یافته مقاله‌های مربوط تمام نسخه‌های 7

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

What do self-supervised speech models know about words?‏

A Pasad, CM Chien, S Settle, K Livescu - Transactions of the …, 2024‏ - direct.mit.edu‏

Many self-supervised speech models (S3Ms) have been introduced over the last few years,
improving performance and data efficiency on various speech tasks. However, these …‏

ذخیره ارجاع بیان شده در 28 یافته مقاله‌های مربوط تمام نسخه‌های 8

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Advancing large language models to capture varied speaking styles and respond properly in spoken conversations‏

GT Lin, CH Chiang, H Lee - arxiv preprint arxiv:2402.12786, 2024‏ - arxiv.org‏

In spoken dialogue, even if two current turns are the same sentence, their responses might
still differ when they are spoken in different styles. The spoken styles, containing …‏

ذخیره ارجاع بیان شده در 19 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the utility of self-supervised models for prosody-related tasks‏

GT Lin, CL Feng, WP Huang, Y Tseng… - 2022 IEEE Spoken …, 2023‏ - ieeexplore.ieee.org‏

Self-Supervised Learning (SSL) from speech data has produced models that have achieved
remarkable performance in many tasks, and that are known to implicitly represent many …‏

ذخیره ارجاع بیان شده در 49 یافته مقاله‌های مربوط تمام نسخه‌های 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Generative pre-training for speech with flow matching‏

AH Liu, M Le, A Vyas, B Shi, A Tjandra… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Generative models have gained more and more attention in recent years for their
remarkable success in tasks that required estimating and sampling data distribution to …‏

ذخیره ارجاع بیان شده در 27 یافته مقاله‌های مربوط تمام نسخه‌های 3 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative...

Audio self-supervised learning: A survey‏

Wavlm: Large-scale self-supervised pre-training for full stack speech processing‏

Comparative layer-wise analysis of self-supervised speech models‏

A survey of reasoning with foundation models‏

Ml-superb: Multilingual speech universal performance benchmark‏

A large-scale evaluation of speech foundation models‏

What do self-supervised speech models know about words?‏

Advancing large language models to capture varied speaking styles and respond properly in spoken conversations‏

On the utility of self-supervised models for prosody-related tasks‏

Generative pre-training for speech with flow matching‏