[HTML][HTML] Computer-assisted pronunciation training—Speech synthesis is almost all you need

D Korzekwa, J Lorenzo-Trueba, T Drugman… - Speech …, 2022 - Elsevier
The research community has long studied computer-assisted pronunciation training (CAPT)
methods in non-native speech. Researchers focused on studying various model …

Weakly-supervised word-level pronunciation error detection in non-native English speech

D Korzekwa, J Lorenzo-Trueba, T Drugman… - arxiv preprint arxiv …, 2021 - arxiv.org
We propose a weakly-supervised model for word-level mispronunciation detection in non-
native (L2) English speech. To train this model, phonetically transcribed L2 speech is not …

Bayes risk ctc: Controllable ctc alignment in sequence-to-sequence tasks

J Tian, B Yan, J Yu, C Weng, D Yu… - arxiv preprint arxiv …, 2022 - arxiv.org
Sequence-to-Sequence (seq2seq) tasks transcribe the input sequence to a target sequence.
The Connectionist Temporal Classification (CTC) criterion is widely used in multiple …

End-to-end word-level disfluency detection and classification in children's reading assessment

L Venkatasubramaniam, V Sunder… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Disfluency detection and classification on children's speech has a great potential for
teaching reading skills. Word-level assessment of children's speech can help teachers to …

Perceptual loss with recognition model for single-channel enhancement and robust ASR

P Plantinga, D Bagchi, E Fosler-Lussier - arxiv preprint arxiv:2112.06068, 2021 - arxiv.org
Single-channel speech enhancement approaches do not always improve automatic
recognition rates in the presence of noise, because they can introduce distortions unhelpful …

'Beach'to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube

K Ramesh, AR KhudaBukhsh, S Kumar - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Over the last few years, YouTube Kids has emerged as one of the highly competitive
alternatives to television for children's entertainment. Consequently, YouTube Kids' content …

End-To-End Real Time Tracking of Children's Reading with Pointer Network

V Sunder, B Karrolla… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
In this work, we explore how a real time reading tracker can be built efficiently for children's
voices. While previously proposed reading trackers focused on ASR-based cascaded …

[PDF][PDF] Orthography-based pronunciation scoring for better CAPT feedback

C Richter, R Pálsson, L O'Brien, K Friðriksdóttir… - Proc. Interspeech …, 2023 - catir.github.io
We establish the viability of a streamlined architecture for pedagogically appropriate
computer assisted pronunciation training (CAPT), to give second language learners …

A Computational Account of Selected Patterns of Linguistic Variation and Change

J Zhu - 2022 - deepblue.lib.umich.edu
Language variation and change are ubiquitous, and one aim of linguistic research is to
understand synchronic variation and how it contributes to change over time. This …

[HTML][HTML] Automated detection of pronunciation errors in non-native English speech employing deep learning

D Korzekwa, B Kostek, R Barra-Chicote - 2022 - amazon.science
Download Copy BibTeX@ Article {Korzekwa2022, author={Daniel Korzekwa and Bozena
Kostek and Roberto Barra-Chicote}, title={Automated detection of pronunciation errors in …