Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition

RN Nandi, MH Menon, TA Muntasir, S Sarker… - ar** automatic speech recognition (ASR) for low-
resource languages is the limited access to labeled data with domain-specific variations. In …

Revisiting automatic speech recognition for tamil and hindi connected number recognition

R Mishra, SRG Boopathy, M Ravikiran… - Proceedings of the …, 2023 - aclanthology.org
Abstract Automatic Speech Recognition and its applications are rising in popularity across
applications with reasonable inference results. Recent state-of-the-art approaches, often …

Exploring Energy-Based Models for Out-of-Distribution Detection in Dialect Identification

Y Hao, C Hu, Y Gao, S Zhang, J Feng - arxiv preprint arxiv:2406.18067, 2024 - arxiv.org
The diverse nature of dialects presents challenges for models trained on specific linguistic
patterns, rendering them susceptible to errors when confronted with unseen or out-of …

A Comprehensive Study on Bangla Automatic Speech Recognition Systems

P Paul, MM Bouh, F Hossain… - 2023 2nd International …, 2023 - ieeexplore.ieee.org
This article examines the current state of speech recognition technologies in Bangla, which
is one of the Low Resource Languages (LRLs) spoken by a population of almost 185 million …

Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition

AM Samin - arxiv preprint arxiv:2401.15532, 2024 - arxiv.org
Byte pair encoding (BPE) emerges as an effective tokenization method for tackling the out-of-
vocabulary (OOV) challenge in various natural language and speech processing tasks …

Fine-tuning convergence model in Bengali speech recognition

Z Ruiying, S Meng - arxiv preprint arxiv:2311.04122, 2023 - arxiv.org
Research on speech recognition has attracted considerable interest due to the difficult task
of segmenting uninterrupted speech. Among various languages, Bengali features distinct …

Speech Recognition Using Adaptation of Whisper Models

V Tyagi, A Dev, P Bansal - International Conference on Artificial …, 2023 - Springer
The application of several deep-learning approaches has led to remarkable improvements
in automatic speech recognition (ASR). In this paper, the authors transcribed recordings of …

[PDF][PDF] A character gram modeling approach towards Bengali Speech to Text with Regional Dialects

MR Hassan - 2023 - researchgate.net
The Bengali language, spoken in various regions of south-Asia and also among the Bengali
diaspora, exhibits rich diversity with regional dialects or variations that reflect the cultural …

[PDF][PDF] Towards Inclusive Digital Health: An Architecture to Extract Health Information from Patients with Low-Resource Language.

P Paul, MM Bouh, A Ahmed - BIOSTEC (2), 2024 - scitepress.org
Collection of health information from the underserved community has been a challenge.
Their health records are not digitized. The major population of the underserved community is …

Enhancing Bangla speech recognition through acoustic and language modeling

R Anam, A Tabassum, SA Arnob, MT Khan - 2024 - dspace.bracu.ac.bd
This work introduces the performance comparison of two deep ASR models, Wav2Vec2 and
Whisper, in recognizing the Bengali language with complex linguistics, dialectal variations …