Speech and speaker recognition using raw waveform modeling for adult and children's speech: A comprehensive review

K Radha, M Bansal, RB Pachori - Engineering Applications of Artificial …, 2024 - Elsevier
Conventionally, the extraction of hand-crafted acoustic features has been separated from the
task of establishing robust machine-learning models in speech processing. The manual …

speechocean762: An open-source non-native english speech corpus for pronunciation assessment

J Zhang, Z Zhang, Y Wang, Z Yan, Q Song… - arxiv preprint arxiv …, 2021 - arxiv.org
This paper introduces a new open-source speech corpus named" speechocean762"
designed for pronunciation assessment use, consisting of 5000 English utterances from 250 …

Automatic Pronunciation Assessment--A Review

YE Kheir, A Ali, SA Chowdhury - arxiv preprint arxiv:2310.13974, 2023 - arxiv.org
Pronunciation assessment and its application in computer-aided pronunciation training
(CAPT) have seen impressive progress in recent years. With the rapid growth in language …

Proficiency assessment of L2 spoken English using wav2vec 2.0

S Bannò, M Matassoni - 2022 IEEE Spoken Language …, 2023 - ieeexplore.ieee.org
The increasing demand for learning English as a second language has led to a growing
interest in methods for automatically assessing spoken language proficiency. Most …

[HTML][HTML] Building educational technologies for code-switching: Current practices, difficulties and future directions

L Nguyen, Z Yuan, G Seed - Languages, 2022 - mdpi.com
Code-switching (CSW) is the phenomenon where speakers use two or more languages in a
single discourse or utterance—an increasingly recognised natural product of multilingualism …

[PDF][PDF] kidsTALC: A Corpus of 3-to 11-year-old German Children's Connected Natural Speech.

L Rumberg, C Gebauer, H Ehlert, M Wallbaum… - …, 2022 - researchgate.net
In this paper we present kidsTALC an audio dataset with orthographic and phonetic
transcriptions of German children's speech collected to facilitate the development of speech …

Improving end-to-end models for children's speech recognition

T Patel, O Scharenborg - Applied Sciences, 2024 - mdpi.com
Children's Speech Recognition (CSR) is a challenging task due to the high variability in
children's speech patterns and limited amount of available annotated children's speech …

Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency

S Bannò, M Matassoni - Speech Communication, 2024 - Elsevier
In an interconnected world where English has become the lingua franca of culture,
entertainment, business, and academia, the growing demand for learning English as a …

Error-preserving Automatic Speech Recognition of Young English Learners' Language

J Michot, M Hürlimann, J Deriu, L Sauer… - arxiv preprint arxiv …, 2024 - arxiv.org
One of the central skills that language learners need to practice is speaking the language.
Currently, students in school do not get enough speaking opportunities and lack …

Data augmentation using cyclegan for end-to-end children asr

DK Singh, PP Amin, HB Sailor… - 2021 29th European …, 2021 - ieeexplore.ieee.org
Recent deep learning algorithms are known to perform better for Automatic Speech
Recognition (ASR) of adult speakers, however, yet remains a challenge to recognize …