Interaction between people with dysarthria and speech recognition systems: A review
In recent years, rapid advancements have taken place for automatic speech recognition
(ASR) systems and devices. Though ASR technologies have increased, the accessibility of …
(ASR) systems and devices. Though ASR technologies have increased, the accessibility of …
[HTML][HTML] Recent advancements in automatic disordered speech recognition: A survey paper
N Gohider, OA Basir - Natural Language Processing Journal, 2024 - Elsevier
Abstract Automatic Speech Recognition technology (ASR) has recently witnessed a
paradigm shift with respect to performance accuracy. Nevertheless, impaired speech …
paradigm shift with respect to performance accuracy. Nevertheless, impaired speech …
Speaker adaptation for Wav2vec2 based dysarthric ASR
Dysarthric speech recognition has posed major challenges due to lack of training data and
heavy mismatch in speaker characteristics. Recent ASR systems have benefited from readily …
heavy mismatch in speaker characteristics. Recent ASR systems have benefited from readily …
Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …
Acoustic modelling from raw source and filter components for dysarthric speech recognition
Acoustic modelling for automatic dysarthric speech recognition (ADSR) is a challenging
task. Data deficiency is a major problem and substantial differences between typical and …
task. Data deficiency is a major problem and substantial differences between typical and …
Hierarchical multi-class classification of voice disorders using self-supervised models and glottal features
Previous studies on the automatic classification of voice disorders have mostly investigated
the binary classification task, which aims to distinguish pathological voice from healthy …
the binary classification task, which aims to distinguish pathological voice from healthy …
Adversarial data augmentation using vae-gan for disordered speech recognition
Automatic recognition of disordered speech remains a highly challenging task to date. The
underlying neuro-motor conditions, often compounded with co-occurring physical …
underlying neuro-motor conditions, often compounded with co-occurring physical …
Self-supervised asr models and features for dysarthric and elderly speech recognition
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …
Duta-vc: A duration-aware typical-to-atypical voice conversion approach with diffusion probabilistic model
We present a novel typical-to-atypical voice conversion approach (DuTa-VC), which (i) can
be trained with nonparallel data (ii) first introduces diffusion probabilistic model (iii) …
be trained with nonparallel data (ii) first introduces diffusion probabilistic model (iii) …
Towards identity preserving normal to dysarthric voice conversion
We present a voice conversion framework that converts normal speech into dysarthric
speech while preserving the speaker identity. Such a framework is essential for (1) clinical …
speech while preserving the speaker identity. Such a framework is essential for (1) clinical …