Acoustic modeling based on deep learning for low-resource speech recognition: An overview

C Yu, M Kang, Y Chen, J Wu, X Zhao - IEEE Access, 2020 - ieeexplore.ieee.org
The polarization of world languages is becoming more and more obvious. Many languages,
mainly endangered languages, are of low-resource attribute due to lack of information. Both …

Data augmentation via dependency tree morphing for low-resource languages

GG Şahin, M Steedman - arxiv preprint arxiv:1903.09460, 2019 - arxiv.org
Neural NLP systems achieve high scores in the presence of sizable training dataset. Lack of
such datasets leads to poor system performances in the case low-resource languages. We …

Comprehensive literature review on children automatic speech recognition system, acoustic linguistic mismatch approaches and challenges

R Sobti, K Guleria, V Kadyan - Multimedia Tools and Applications, 2024 - Springer
Abstract Automatic Speech Recognition (ASR) system for children is as important as for
adults since children are more dependent on these systems nowadays, such as computer …

Improving data augmentation for low resource speech-to-text translation with diverse paraphrasing

C Mi, L **e, Y Zhang - Neural Networks, 2022 - Elsevier
High quality end-to-end speech translation model relies on a large scale of speech-to-text
training data, which is usually scarce or even unavailable for some low-resource language …

Speaker augmentation for low resource speech recognition

C Du, K Yu - ICASSP 2020-2020 IEEE International Conference …, 2020 - ieeexplore.ieee.org
Text-to-speech synthesis (TTS) has been used as a data augmentation approach for
automatic speech recognition (ASR), leveraging additional texts for ASR training. However …

[PDF][PDF] Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System.

C Kim, M Shin, A Garg, D Gowda - Interspeech, 2019 - researchgate.net
In this paper, we present an improved vocal tract length perturbation (VTLP) algorithm as a
data augmentation technique. VTLP is usually accomplished by adjusting the center …

Automatic speech recognition for Uyghur, Kazakh, and Kyrgyz: An overview

W Du, Y Maimaitiyiming, M Nijat, L Li, A Hamdulla… - Applied Sciences, 2022 - mdpi.com
With the emergence of deep learning, the performance of automatic speech recognition
(ASR) systems has remarkably improved. Especially for resource-rich languages such as …

[PDF][PDF] Data Augmentation Improves Recognition of Foreign Accented Speech.

T Fukuda, R Fernandez, A Rosenberg, S Thomas… - Interspeech, 2018 - isca-archive.org
Speech recognition of foreign accented (non-native or L2) speech remains a challenge to
the state-of-the-art. The most common approach to address this scenario involves the …

In domain training data augmentation on noise robust Punjabi Children speech recognition

V Kadyan, P Bawa, T Hasija - Journal of Ambient Intelligence and …, 2022 - Springer
For building a successful automatic speech recognition (ASR) engine large training data is
required. It increases training complexity and become impossible for less resource language …

Inspection method of the corrosion rate for underwater grouting sleeves by integrating ultrasonic data augmentation and interpretable ensemble learning

W Wang, S Jiang, H Song, H Wu, S Wang - Measurement, 2024 - Elsevier
To addresses the challenge of underwater grouting sleeve corrosion affecting bridge service
life, we propose an innovative inspection approach by integrating a modified ultrasonic …