A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

[PDF][PDF] Deep unsupervised domain adaptation: A review of recent advances and perspectives

X Liu, C Yoo, F **ng, H Oh, G El Fakhri… - … on Signal and …, 2022 - nowpublishers.com
Deep learning has become the method of choice to tackle real-world problems in different
domains, partly because of its ability to learn from data and achieve impressive performance …

Domain adaptation: challenges, methods, datasets, and applications

P Singhal, R Walambe, S Ramanna, K Kotecha - IEEE access, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) trained on one dataset (source domain) do not perform well
on another set of data (target domain), which is different but has similar properties as the …

Pada: Pruning assisted domain adaptation for self-supervised speech representations

VS Lodagala, S Ghosh, S Umesh - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
While self-supervised speech representation learning (SSL) models serve a variety of
downstream tasks, these models have been observed to overfit to the domain from which the …

Boosting cross-domain speech recognition with self-supervision

H Zhu, G Cheng, J Wang, W Hou… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
The cross-domain performance of automatic speech recognition (ASR) could be severely
hampered due to the mismatch between training and testing distributions. Since the target …

Sample-efficient unsupervised domain adaptation of speech recognition systems: A case study for modern greek

G Paraskevopoulos, T Kouzelis… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org
Modern speech recognition systems exhibit rapid performance degradation under domain
shift. This issue is especially prevalent in data-scarce settings, such as low-resource …

Meta-learning for Indian languages: Performance analysis and improvements with linguistic similarity measures

CS Anoop, AG Ramakrishnan - IEEE Access, 2023 - ieeexplore.ieee.org
Indian languages share a lot of overlap in acoustic and linguistic content. Though different
languages use different writing systems, the phoneme sets logically overlap. Most of these …

Code-Switching ASR for Low-Resource Indic Languages: A Hindi-Marathi Case Study

H Palivela, M Narvekar, D Asirvatham, S Bhusan… - IEEE …, 2025 - ieeexplore.ieee.org
This work examines the development of Automatic Speech Recognition (ASR) systems for
low-resource languages, focusing on Hindi and Marathi, particularly in multilingual and code …

Investigation of different G2P schemes for speech recognition in Sanskrit

CS Anoop, AG Ramakrishnan - … 2021, Sanur, Bali, Indonesia, December 8 …, 2021 - Springer
In this work, we explore the impact of different grapheme to phoneme (G2P) conversion
schemes for the task of automatic speech recognition (ASR) in Sanskrit. The performance of …

[HTML][HTML] Unsupervised adaptation of deep speech activity detection models to unseen domains

P Gimeno, D Ribas, A Ortega, A Miguel, E Lleida - Applied Sciences, 2022 - mdpi.com
Speech Activity Detection (SAD) aims to accurately classify audio fragments containing
human speech. Current state-of-the-art systems for the SAD task are mainly based on deep …