Characterizing dysarthria diversity for automatic speech recognition: A tutorial from the clinical perspective

HP Rowe, SE Gutz, MF Maffei, K Tomanek… - Frontiers in computer …, 2022 - frontiersin.org
Despite significant advancements in automatic speech recognition (ASR) technology, even
the best performing ASR systems are inadequate for speakers with impaired speech. This …

Interaction between people with dysarthria and speech recognition systems: A review

A Jaddoh, F Loizides, O Rana - Assistive Technology, 2023 - Taylor & Francis
In recent years, rapid advancements have taken place for automatic speech recognition
(ASR) systems and devices. Though ASR technologies have increased, the accessibility of …

Exploring self-supervised pre-trained asr models for dysarthric and elderly speech recognition

S Hu, X **e, Z **, M Geng, Y Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic recognition of disordered and elderly speech remains a highly challenging task to
date due to the difficulty in collecting such data in large quantities. This paper explores a …

Self-supervised asr models and features for dysarthric and elderly speech recognition

S Hu, X **e, M Geng, Z **, J Deng, G Li… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org
Self-supervised learning (SSL) based speech foundation models have been applied to a
wide range of ASR tasks. However, their application to dysarthric and elderly speech via …

Speaker adaptation using spectro-temporal deep features for dysarthric and elderly speech recognition

M Geng, X **e, Z Ye, T Wang, G Li, S Hu… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech in recent decades, accurate recognition of dysarthric and elderly speech …

A deep learning approach to dysarthric utterance classification with BiLSTM-GRU, speech cue filtering, and log mel spectrograms

S Mehra, V Ranga, R Agarwal - The Journal of Supercomputing, 2024 - Springer
Assessing the intelligibility of dysarthric speech, characterized by intricate speaking rhythms
presents formidable challenges. Current techniques for objectively testing speech …

Personalized adversarial data augmentation for dysarthric and elderly speech recognition

Z **, M Geng, J Deng, T Wang, S Hu… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Despite the rapid progress of automatic speech recognition (ASR) technologies targeting
normal speech, accurate recognition of dysarthric and elderly speech remains a highly …

Data augmentation techniques for transfer learning-based continuous dysarthric speech recognition

TA Mariya Celin, P Vijayalakshmi… - Circuits, Systems, and …, 2023 - Springer
Data augmentation is an essential component in building a dysarthric speech recognition
system, as speech data collection from dysarthric speakers with varying degree of disorder …

Use of speech impairment severity for dysarthric speech recognition

M Geng, Z **, T Wang, S Hu, J Deng, M Cui… - arxiv preprint arxiv …, 2023 - arxiv.org
A key challenge in dysarthric speech recognition is the speaker-level diversity attributed to
both speaker-identity associated factors such as gender, and speech impairment severity …

Extending parrotron: An end-to-end, speech conversion and speech recognition model for atypical speech

R Doshi, Y Chen, L Jiang, X Zhang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
We present an extended Parrotron model: a single, end-to-end network that enables voice
conversion and recognition simultaneously. Input spectrograms are transformed to output …