Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning

NF Chen, H Li - 2016 Asia-Pacific Signal and Information …, 2016 - ieeexplore.ieee.org
This paper reviews the research approaches used in computer-assisted pronunciation
training (CAPT), addresses the existing challenges, and discusses emerging trends and …

CNN-RNN-CTC based end-to-end mispronunciation detection and diagnosis

WK Leung, X Liu, H Meng - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
This paper focuses on using Convolutional Neural Network (CNN), Recurrent Neural
Network (RNN) and Connection-ist Temporal Classification (CTC) to build an end-to-end …

An end-to-end mispronunciation detection system for L2 English speech leveraging novel anti-phone modeling

BC Yan, MC Wu, HT Hung, B Chen - arxiv preprint arxiv:2005.11950, 2020 - arxiv.org
Mispronunciation detection and diagnosis (MDD) is a core component of computer-assisted
pronunciation training (CAPT). Most of the existing MDD approaches focus on dealing with …

[HTML][HTML] Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL

NF Chen, D Wee, R Tong, B Ma, H Li - Speech Communication, 2016 - Elsevier
In this work, we analyze phonetic and prosodic pronunciation patterns from iCALL, a speech
corpus designed to evaluate Mandarin mispronunciations by non-native speakers of …

Non-native children speech recognition through transfer learning

M Matassoni, R Gretter, D Falavigna… - … on Acoustics, Speech …, 2018 - ieeexplore.ieee.org
This work deals with non-native children's speech and investigates both multi-task and
transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to …

Improving mispronunciation detection with wav2vec2-based momentum pseudo-labeling for accentedness and intelligibility assessment

M Yang, K Hirschi, SD Looney, O Kang… - arxiv preprint arxiv …, 2022 - arxiv.org
Current leading mispronunciation detection and diagnosis (MDD) systems achieve
promising performance via end-to-end phoneme recognition. One challenge of such end-to …

TLT-school: a corpus of non native children speech

R Gretter, M Matassoni, S Bannò… - arxiv preprint arxiv …, 2020 - arxiv.org
This paper describes" TLT-school" a corpus of speech utterances collected in schools of
northern Italy for assessing the performance of students learning both English and German …

Towards robust mispronunciation detection and diagnosis for L2 English learners with accent-modulating methods

SWF Jiang, BC Yan, TH Lo, FA Chao… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org
With the acceleration of globalization, more and more people are willing or required to learn
second languages (L2). One of the major remaining challenges facing current …

Mispronunciation detection and diagnosis using deep neural networks: a systematic review

M Lounis, B Dendani, H Bahi - Multimedia Tools and Applications, 2024 - Springer
The increased need for foreign language learning, along with advances in speech
technology have heightened interest in computer-assisted pronunciation teaching (CAPT) …

Ood-speech: A large bengali speech recognition dataset for out-of-distribution benchmarking

FR Rakib, SS Dip, S Alam, N Tasnim… - arxiv preprint arxiv …, 2023 - arxiv.org
We present OOD-Speech, the first out-of-distribution (OOD) benchmarking dataset for
Bengali automatic speech recognition (ASR). Being one of the most spoken languages …