Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers

W Hu, Y Qian, FK Soong, Y Wang - Speech Communication, 2015 - Elsevier
Mispronunciation detection is an important part in a Computer-Aided Language Learning
(CALL) system. By automatically pointing out where mispronunciations occur in an …

The automatic detection of speech disorders in children: Challenges, opportunities, and preliminary results

M Shahin, U Zafar, B Ahmed - IEEE Journal of Selected Topics …, 2019 - ieeexplore.ieee.org
Given the limited accessibility to Speech and Language Pathologists (SLPs) children in
need often have, pediatric Computer-Aided Speech Therapy (CAST) tools can play an …

Mispronunciation detection and diagnosis in l2 english speech using multidistribution deep neural networks

K Li, X Qian, H Meng - IEEE/ACM Transactions on Audio …, 2016 - ieeexplore.ieee.org
This paper investigates the use of multidistribution deep neural networks (DNNs) for
mispronunciation detection and diagnosis (MDD), to circumvent the difficulties encountered …

Maximum F1-score discriminative training criterion for automatic mispronunciation detection

H Huang, H Xu, X Wang… - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
We carry out an in-depth investigation on a newly proposed Maximum F1-score Criterion
(MFC) discriminative training objective function for Goodness of Pronunciation (GOP) based …

Context-aware goodness of pronunciation for computer-assisted pronunciation training

J Shi, N Huo, Q ** - arxiv preprint arxiv:2008.08647, 2020 - arxiv.org
Mispronunciation detection is an essential component of the Computer-Assisted
Pronunciation Training (CAPT) systems. State-of-the-art mispronunciation detection models …

[PDF][PDF] A Study on Fine-Tuning wav2vec2. 0 Model for the Task of Mispronunciation Detection and Diagnosis.

L Peng, K Fu, B Lin, D Ke, J Zhang - Interspeech, 2021 - isca-archive.org
Mispronunciation detection and diagnosis (MDD) technology is a key component of
computer-assisted pronunciation training system (CAPT). The mainstream method is based …

Automatic Pronunciation Assessment--A Review

YE Kheir, A Ali, SA Chowdhury - arxiv preprint arxiv:2310.13974, 2023 - arxiv.org
Pronunciation assessment and its application in computer-aided pronunciation training
(CAPT) have seen impressive progress in recent years. With the rapid growth in language …

[PDF][PDF] An Improved Goodness of Pronunciation (GoP) Measure for Pronunciation Evaluation with DNN-HMM System Considering HMM Transition Probabilities.

S Sudhakara, MK Ramanathi, C Yarra, PK Ghosh - INTERSPEECH, 2019 - academia.edu
Goodness of pronunciation (GoP) is typically formulated with Gaussian mixture model-
hidden Markov model (GMM-HMM) based acoustic models considering HMM state transition …

End-to-end neural network based automated speech scoring

L Chen, J Tao, S Ghaffarzadegan… - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
In recent years, machine learning models for automated speech scoring systems were
mainly built using data-driven approaches with handcrafted features as one of the main …

End-to-end automatic pronunciation error detection based on improved hybrid ctc/attention architecture

L Zhang, Z Zhao, C Ma, L Shan, H Sun, L Jiang… - Sensors, 2020 - mdpi.com
Advanced automatic pronunciation error detection (APED) algorithms are usually based on
state-of-the-art automatic speech recognition (ASR) techniques. With the development of …