Interaction between people with dysarthria and speech recognition systems: A review

A Jaddoh, F Loizides, O Rana - Assistive Technology, 2023 - Taylor & Francis
In recent years, rapid advancements have taken place for automatic speech recognition
(ASR) systems and devices. Though ASR technologies have increased, the accessibility of …

[HTML][HTML] Recent advancements in automatic disordered speech recognition: A survey paper

N Gohider, OA Basir - Natural Language Processing Journal, 2024 - Elsevier
Abstract Automatic Speech Recognition technology (ASR) has recently witnessed a
paradigm shift with respect to performance accuracy. Nevertheless, impaired speech …

Speaker adaptation for Wav2vec2 based dysarthric ASR

MK Baskar, T Herzig, D Nguyen, M Diez… - arxiv preprint arxiv …, 2022 - arxiv.org
Dysarthric speech recognition has posed major challenges due to lack of training data and
heavy mismatch in speaker characteristics. Recent ASR systems have benefited from readily …

Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition

M Geng, X **e, Z Ye, T Wang, G Li, S Hu… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …

Acoustic modelling from raw source and filter components for dysarthric speech recognition

Z Yue, E Loweimi, H Christensen… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
Acoustic modelling for automatic dysarthric speech recognition (ADSR) is a challenging
task. Data deficiency is a major problem and substantial differences between typical and …

Hierarchical multi-class classification of voice disorders using self-supervised models and glottal features

S Tirronen, SR Kadiri, P Alku - IEEE Open Journal of Signal …, 2023 - ieeexplore.ieee.org
Previous studies on the automatic classification of voice disorders have mostly investigated
the binary classification task, which aims to distinguish pathological voice from healthy …

Adversarial data augmentation using vae-gan for disordered speech recognition

Z **, X **e, M Geng, T Wang, S Hu… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …

Self-supervised asr models and features for dysarthric and elderly speech recognition

S Hu, X **e, M Geng, Z **, J Deng, G Li… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …

Duta-vc: A duration-aware typical-to-atypical voice conversion approach with diffusion probabilistic model

H Wang, T Thebaud, J Villalba, M Sydnor… - arxiv preprint arxiv …, 2023 - arxiv.org
We present a novel typical-to-atypical voice conversion approach (DuTa-VC), which (i) can
be trained with nonparallel data (ii) first introduces diffusion probabilistic model (iii) …

Towards identity preserving normal to dysarthric voice conversion

WC Huang, BM Halpern, LP Violeta… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
We present a voice conversion framework that converts normal speech into dysarthric
speech while preserving the speaker identity. Such a framework is essential for (1) clinical …