Multilingual and code-switching ASR challenges for low resource Indian languages

A Diwan, R Vaideeswaran, S Shah, A Singh… - arxiv preprint arxiv …, 2021 - arxiv.org
Recently, there is increasing interest in multilingual automatic speech recognition (ASR)
where a speech recognition system caters to multiple low resource languages by taking …

[HTML][HTML] Automatic Speech Recognition: A survey of deep learning techniques and approaches

H Ahlawat, N Aggarwal, D Gupta - International Journal of Cognitive …, 2025 - Elsevier
Significant research has been conducted during the last decade on the application of
machine learning for speech processing, particularly speech recognition. However, in recent …

Open-source multi-speaker speech corpora for building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu speech synthesis systems

F He, SHC Chu, O Kjartansson, C Rivera… - Proceedings of the …, 2020 - aclanthology.org
We present free high quality multi-speaker speech corpora for Gujarati, Kannada,
Malayalam, Marathi, Tamil and Telugu, which are six of the twenty two official languages of …

A survey of multilingual models for automatic speech recognition

H Yadav, S Sitaram - arxiv preprint arxiv:2202.12576, 2022 - arxiv.org
Although Automatic Speech Recognition (ASR) systems have achieved human-like
performance for a few languages, the majority of the world's languages do not have usable …

Towards building asr systems for the next billion users

T Javed, S Doddapaneni, A Raman… - Proceedings of the …, 2022 - ojs.aaai.org
Recent methods in speech and language technology pretrain very large models which are
fine-tuned for specific tasks. However, the benefits of such large models are often limited to a …

[PDF][PDF] Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.

S Khare, AR Mittal, A Diwan, S Sarawagi, P Jyothi… - Interspeech, 2021 - isca-archive.org
Cross-lingual transfer of knowledge from high-resource languages to low-resource
languages is an important research problem in automatic speech recognition (ASR). We …

Indicsuperb: A speech processing universal performance benchmark for indian languages

T Javed, K Bhogale, A Raman, P Kumar… - Proceedings of the …, 2023 - ojs.aaai.org
A cornerstone in AI research has been the creation and adoption of standardized training
and test datasets to earmark the progress of state-of-the-art models. A particularly successful …

Indicvoices: Towards building an inclusive multilingual speech dataset for indian languages

T Javed, JA Nawale, EI George, S Joshi… - arxiv preprint arxiv …, 2024 - arxiv.org
We present INDICVOICES, a dataset of natural and spontaneous speech containing a total
of 7348 hours of read (9%), extempore (74%) and conversational (17%) audio from 16237 …

[PDF][PDF] End-to-End Speech Recognition of Tamil Language.

MH Changrampadi, A Shahina… - … Automation & Soft …, 2022 - researchgate.net
Research in speech recognition is progressing with numerous state-ofthe-art results in
recent times. However, relatively fewer research is being carried out in Automatic Speech …

Automatic speech recognition in Sanskrit: A new speech corpus and modelling insights

D Adiga, R Kumar, A Krishna, P Jyothi… - arxiv preprint arxiv …, 2021 - arxiv.org
Automatic speech recognition (ASR) in Sanskrit is interesting, owing to the various linguistic
peculiarities present in the language. The Sanskrit language is lexically productive …