- Academic Search

A Conneau, M Ma, S Khanuja, Y Zhang… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

We introduce FLEURS, the Few-shot Learning Evaluation of Universal Representations of
Speech benchmark. FLEURS is an n-way parallel speech dataset in 102 languages built on …

Speichern Zitieren Zitiert von: 279 Ähnliche Artikel Alle 6 Versionen

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Speech and language processing with deep learning for dementia diagnosis: A systematic review

M Shi, G Cheung, SR Shahamiri - Psychiatry Research, 2023 - Elsevier

Dementia is a progressive neurodegenerative disease that burdens the person living with
the disease, their families, and medical and social services. Timely diagnosis of dementia …

Speichern Zitieren Zitiert von: 17 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] arxiv.org

Real-time neural radiance talking portrait synthesis via audio-spatial decomposition

J Tang, K Wang, H Zhou, X Chen, D He, T Hu… - ar** the
content unchanged. In this paper, we focus on self-supervised representation learning for …

Speichern Zitieren Zitiert von: 120 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] arxiv.org

Simple and effective zero-shot cross-lingual phoneme recognition

Q Xu, A Baevski, M Auli - arxiv preprint arxiv:2109.11680, 2021 - arxiv.org

Recent progress in self-training, self-supervised pretraining and unsupervised learning
enabled well performing speech recognition systems without any labeled data. However, in …

Speichern Zitieren Zitiert von: 92 Ähnliche Artikel Alle 6 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Improving massively multilingual asr with auxiliary ctc objectives

W Chen, B Yan, J Shi, Y Peng, S Maiti… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Multilingual Automatic Speech Recognition (ASR) models have extended the usability of
speech technologies to a wide variety of languages. With how many languages these …

Speichern Zitieren Zitiert von: 43 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] arxiv.org

Language ID in the wild: Unexpected challenges on the path to a thousand-language web text corpus

I Caswell, T Breiner, D Van Esch, A Bapna - arxiv preprint arxiv …, 2020 - arxiv.org

Large text corpora are increasingly important for a wide variety of Natural Language
Processing (NLP) tasks, and automatic language identification (LangID) is a core technology …

Speichern Zitieren Zitiert von: 91 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[PDF] mdpi.com

Multilingual speech recognition for Turkic languages

S Mussakhojayeva, K Dauletbek, R Yeshpanov… - Information, 2023 - mdpi.com

The primary aim of this study was to contribute to the development of multilingual automatic
speech recognition for lower-resourced Turkic languages. Ten languages—Azerbaijani …

Speichern Zitieren Zitiert von: 27 Ähnliche Artikel Alle 3 Versionen Im Cache

[Free GPT-4]

[PDF] arxiv.org

ASR2K: Speech recognition for around 2000 languages without audio

X Li, F Metze, DR Mortensen, AW Black… - arxiv preprint arxiv …, 2022 - arxiv.org

Most recent speech recognition models rely on large supervised datasets, which are
unavailable for many low-resource languages. In this work, we present a speech recognition …

Speichern Zitieren Zitiert von: 24 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] isca-archive.org

[PDF][PDF] Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.

S Khare, AR Mittal, A Diwan, S Sarawagi, P Jyothi… - Interspeech, 2021 - isca-archive.org

Cross-lingual transfer of knowledge from high-resource languages to low-resource
languages is an important research problem in automatic speech recognition (ASR). We …

Speichern Zitieren Zitiert von: 45 Ähnliche Artikel Alle 7 Versionen HTML-Version

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Universal phone recognition with a multilingual allophone system

Fleurs: Few-shot learning evaluation of universal representations of speech

[HTML][HTML] Speech and language processing with deep learning for dementia diagnosis: A systematic review

Real-time neural radiance talking portrait synthesis via audio-spatial decomposition

Simple and effective zero-shot cross-lingual phoneme recognition

Improving massively multilingual asr with auxiliary ctc objectives

Language ID in the wild: Unexpected challenges on the path to a thousand-language web text corpus

Multilingual speech recognition for Turkic languages

ASR2K: Speech recognition for around 2000 languages without audio

[PDF][PDF] Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.