Automatic speech recognition (asr) systems for children: A systematic literature review

V Bhardwaj, MT Ben Othman, V Kukreja, Y Belkhier… - Applied Sciences, 2022 - mdpi.com
Automatic speech recognition (ASR) is one of the ways used to transform acoustic speech
signals into text. Over the last few decades, an enormous amount of research work has been …

Sep-28k: A dataset for stuttering event detection from podcasts with people who stutter

C Lea, V Mitra, A Joshi, S Kajarekar… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
The ability to automatically detect stuttering events in speech could help speech pathologists
track an individual's fluency over time or help improve speech recognition systems for …

From user perceptions to technical improvement: Enabling people who stutter to better use speech recognition

C Lea, Z Huang, J Narain, L Tooley, D Yee… - Proceedings of the …, 2023 - dl.acm.org
Consumer speech recognition systems do not work as well for many people with speech
differences, such as stuttering, relative to the rest of the general population. However, what …

KSoF: The Kassel state of fluency dataset--a therapy centered dataset of stuttering

SP Bayerl, AW von Gudenberg, F Hönig, E Nöth… - arxiv preprint arxiv …, 2022 - arxiv.org
Stuttering is a complex speech disorder that negatively affects an individual's ability to
communicate effectively. Persons who stutter (PWS) often suffer considerably under the …

“The World is Designed for Fluent People”: Benefits and Challenges of Videoconferencing Technologies for People Who Stutter

S Wu - Proceedings of the 2023 CHI Conference on Human …, 2023 - dl.acm.org
This work studies the experiences of people who stutter (PWS) with videoconferencing (VC)
and VC technologies. Our interview study with 13 adults who stutter uncovers extra …

Analysis and tuning of a voice assistant system for dysfluent speech

V Mitra, Z Huang, C Lea, L Tooley, S Wu… - arxiv preprint arxiv …, 2021 - arxiv.org
Dysfluencies and variations in speech pronunciation can severely degrade speech
recognition performance, and for many individuals with moderate-to-severe speech …

Ssdm: Scalable speech dysfluency modeling

J Lian, X Zhou, Z Ezzes, J Vonk, B Morin… - arxiv preprint arxiv …, 2024 - arxiv.org
Speech dysfluency modeling is the core module for spoken language learning, and speech
therapy. However, there are three challenges. First, current state-of-the-art solutions\cite …

Unconstrained dysfluency modeling for dysfluent speech transcription and detection

J Lian, C Feng, N Farooqi, S Li… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
Dysfluent speech modeling requires time-accurate and silence-aware transcription at both
the word-level and phonetic-level. However, current research in dysfluency modeling …

Sequence labeling to detect stuttering events in read speech

S Alharbi, M Hasan, AJH Simons, S Brumfitt… - Computer Speech & …, 2020 - Elsevier
Stuttering is a speech disorder that, if treated during childhood, may be prevented from
persisting into adolescence. A clinician must first determine the severity of stuttering …

[PDF][PDF] Frame-Level Stutter Detection.

JB Harvill, M Hasegawa-Johnson, CD Yoo - INTERSPEECH, 2022 - isca-archive.org
Previous studies on the detection of stuttered speech have focused on classification at the
utterance level (eg, for speech therapy applications), and on the correct insertion of stutter …