Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning
This paper reviews the research approaches used in computer-assisted pronunciation
training (CAPT), addresses the existing challenges, and discusses emerging trends and …
training (CAPT), addresses the existing challenges, and discusses emerging trends and …
CNN-RNN-CTC based end-to-end mispronunciation detection and diagnosis
This paper focuses on using Convolutional Neural Network (CNN), Recurrent Neural
Network (RNN) and Connection-ist Temporal Classification (CTC) to build an end-to-end …
Network (RNN) and Connection-ist Temporal Classification (CTC) to build an end-to-end …
An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling
Mispronunciation detection and diagnosis (MDD) is a core component of computer-assisted
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …
[HTML][HTML] Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL
In this work, we analyze phonetic and prosodic pronunciation patterns from iCALL, a speech
corpus designed to evaluate Mandarin mispronunciations by non-native speakers of …
corpus designed to evaluate Mandarin mispronunciations by non-native speakers of …
Non-native children speech recognition through transfer learning
This work deals with non-native children's speech and investigates both multi-task and
transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to …
transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to …
Improving mispronunciation detection with wav2vec2-based momentum pseudo-labeling for accentedness and intelligibility assessment
Current leading mispronunciation detection and diagnosis (MDD) systems achieve
promising performance via end-to-end phoneme recognition. One challenge of such end-to …
promising performance via end-to-end phoneme recognition. One challenge of such end-to …
TLT-school: a corpus of non native children speech
This paper describes" TLT-school" a corpus of speech utterances collected in schools of
northern Italy for assessing the performance of students learning both English and German …
northern Italy for assessing the performance of students learning both English and German …
Towards robust mispronunciation detection and diagnosis for L2 English learners with accent-modulating methods
With the acceleration of globalization, more and more people are willing or required to learn
second languages (L2). One of the major remaining challenges facing current …
second languages (L2). One of the major remaining challenges facing current …
Mispronunciation detection and diagnosis using deep neural networks: a systematic review
The increased need for foreign language learning, along with advances in speech
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …
Ood-speech: A large bengali speech recognition dataset for out-of-distribution benchmarking
We present OOD-Speech, the first out-of-distribution (OOD) benchmarking dataset for
Bengali automatic speech recognition (ASR). Being one of the most spoken languages …
Bengali automatic speech recognition (ASR). Being one of the most spoken languages …