Improving massively multilingual asr with auxiliary ctc objectives
Multilingual Automatic Speech Recognition (ASR) models have extended the usability of
speech technologies to a wide variety of languages. With how many languages these …
speech technologies to a wide variety of languages. With how many languages these …
Streaming end-to-end multilingual speech recognition with joint language identification
Language identification is critical for many downstream tasks in automatic speech
recognition (ASR), and is beneficial to integrate into multilingual end-to-end ASR as an …
recognition (ASR), and is beneficial to integrate into multilingual end-to-end ASR as an …
Aligning speech to languages to enhance code-switching speech recognition
Code-switching (CS) refers to the switching of languages within a speech signal and results
in language confusion for automatic speech recognition (ASR). To address language …
in language confusion for automatic speech recognition (ASR). To address language …
Language-specific characteristic assistance for code-switching speech recognition
Dual-encoder structure successfully utilizes two language-specific encoders (LSEs) for code-
switching speech recognition. Because LSEs are initialized by two pre-trained language …
switching speech recognition. Because LSEs are initialized by two pre-trained language …
Lae: Language-aware encoder for monolingual and multilingual asr
Despite the rapid progress in automatic speech recognition (ASR) research, recognizing
multilingual speech using a unified ASR system remains highly challenging. Previous works …
multilingual speech using a unified ASR system remains highly challenging. Previous works …
Language-routing mixture of experts for multilingual and code-switching speech recognition
W Wang, G Ma, Y Li, B Du - arxiv preprint arxiv:2307.05956, 2023 - arxiv.org
Multilingual speech recognition for both monolingual and code-switching speech is a
challenging task. Recently, based on the Mixture of Experts (MoE), many works have made …
challenging task. Recently, based on the Mixture of Experts (MoE), many works have made …
Enhancing code-switching speech recognition with interactive language biases
Languages usually switch within a multilingual speech signal, especially in a bilingual
society. This phenomenon is referred to as code-switching (CS), making automatic speech …
society. This phenomenon is referred to as code-switching (CS), making automatic speech …
Towards zero-shot code-switched speech recognition
In this work, we seek to build effective code-switched (CS) automatic speech recognition
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …
systems (ASR) under the zero-shot set-ting where no transcribed CS speech data is …
Adapting OpenAI's Whisper for speech recognition on code-switch mandarin-english seame and asru2019 datasets
This paper reports on SOTA results achieved using openAI's Whisper model with adaptation
on different adaptation corpus sizes for two established code-switch Mandarin/English …
on different adaptation corpus sizes for two established code-switch Mandarin/English …
Internal language model estimation based language model fusion for cross-domain code-switching speech recognition
Internal Language Model Estimation (ILME) based language model (LM) fusion has been
shown significantly improved recognition results over conventional shallow fusion in both …
shown significantly improved recognition results over conventional shallow fusion in both …