Creating speaker independent ASR system through prosody modification based data augmentation
In this paper, the effect of prosody-modification-based data augmentation is explored in the
context of automatic speech recognition (ASR). The primary motive is to develop ASR …
context of automatic speech recognition (ASR). The primary motive is to develop ASR …
[HTML][HTML] A formant modification method for improved ASR of children's speech
Differences in acoustic characteristics between children's and adults' speech degrade
performance of automatic speech recognition systems when systems trained using adults' …
performance of automatic speech recognition systems when systems trained using adults' …
A text-to-speech pipeline, evaluation methodology, and initial fine-tuning results for child speech synthesis
Speech synthesis has come a long way as current text-to-speech (TTS) models can now
generate natural human-sounding speech. However, most of the TTS research focuses on …
generate natural human-sounding speech. However, most of the TTS research focuses on …
Addressing noise and pitch sensitivity of speech recognition system through variational mode decomposition based spectral smoothing
In this paper, we propose a novel front-end speech parameterization technique for automatic
speech recognition (ASR) that is less sensitive towards ambient noise and pitch variations …
speech recognition (ASR) that is less sensitive towards ambient noise and pitch variations …
In-domain and out-of-domain data augmentation to improve children's speaker verification system in limited data scenario
In this paper, we present our efforts towards develo** a robust automatic speaker
verification (ASV) system for children when the domain-specific data is limited. For that …
verification (ASV) system for children when the domain-specific data is limited. For that …
Creating robust children's ASR system in zero-resource condition through out-of-domain data augmentation
Develo** an automatic speech recognition (ASR) system for children's speech is
extremely challenging due to the unavailability of data from the child domain for the majority …
extremely challenging due to the unavailability of data from the child domain for the majority …
Spectral war** and data augmentation for low resource language ASR system under mismatched conditions
The performance of an Automatic Speech Recognition System (ASR) system deteriorates
while using it on children speech, due to large variations and mismatch of acoustic and …
while using it on children speech, due to large variations and mismatch of acoustic and …
Children's speaker verification in low and zero resource conditions
Our efforts towards develo** an automatic speaker verification (ASV) system for child
speakers are presented in this paper. For the majority of the languages, children's speech …
speakers are presented in this paper. For the majority of the languages, children's speech …
Develo** children's ASR system under low-resource conditions using end-to-end architecture
S Shahnawazuddin - Digital Signal Processing, 2024 - Elsevier
The work presented in this paper aims at enhancing the performance of end-to-end (E2E)
speech recognition task for children's speech under low resource conditions. For majority of …
speech recognition task for children's speech under low resource conditions. For majority of …
ChildAugment: Data augmentation methods for zero-resource children's speaker verification
The accuracy of modern automatic speaker verification (ASV) systems, when trained
exclusively on adult data, drops substantially when applied to children's speech. The …
exclusively on adult data, drops substantially when applied to children's speech. The …