Racial disparities in automated speech recognition

A Koenecke, A Nam, E Lake, J Nudell… - Proceedings of the …, 2020 - pnas.org
Automated speech recognition (ASR) systems, which use sophisticated machine-learning
algorithms to convert spoken language to text, have become increasingly widespread …

[HTML][HTML] Towards inclusive automatic speech recognition

S Feng, BM Halpern, O Kudina… - Computer Speech & …, 2024 - Elsevier
Practice and recent evidence show that state-of-the-art (SotA) automatic speech recognition
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …

Quantifying bias in automatic speech recognition

S Feng, O Kudina, BM Halpern… - arxiv preprint arxiv …, 2021 - arxiv.org
Automatic speech recognition (ASR) systems promise to deliver objective interpretation of
human speech. Practice and recent evidence suggests that the state-of-the-art (SotA) ASRs …

[HTML][HTML] Evaluating OpenAI's Whisper ASR: Performance analysis across diverse accents and speaker traits

C Graham, N Roll - JASA Express Letters, 2024 - pubs.aip.org
This study investigates Whisper's automatic speech recognition (ASR) system performance
across diverse native and non-native English accents. Results reveal superior recognition in …

[HTML][HTML] Combining automatic speech recognition with semantic natural language processing in schizophrenia

S Ciampelli, AE Voppel, JN De Boer, S Koops… - Psychiatry …, 2023 - Elsevier
Natural language processing (NLP) tools are increasingly used to quantify semantic
anomalies in schizophrenia. Automatic speech recognition (ASR) technology, if robust …

Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates

S Goldwater, D Jurafsky, CD Manning - Speech Communication, 2010 - Elsevier
Despite years of speech recognition research, little is known about which words tend to be
misrecognized and why. Previous work has shown that errors increase for infrequent words …

[HTML][HTML] The synergy between a humanoid robot and whisper: bridging a gap in education

A Pande, D Mishra - Electronics, 2023 - mdpi.com
Students may encounter problems concentrating during a lecture due to various reasons,
which can be related to the educator's accent or the student's auditory difficulties. This may …

[PDF][PDF] Training and typological bias in ASR performance for world Englishes.

MPY Chan, J Choe, A Li, Y Chen, X Gao… - …, 2022 - isca-archive.org
The use of automatic speech recognition (ASR) has been increasing to promote inclusion
and accessibility. Nonetheless, prior work on ASR finds performance gaps conditioned by …

Gender representation in French broadcast corpora and its impact on ASR performance

M Garnerin, S Rossato, L Besacier - … of the 1st international workshop on …, 2019 - dl.acm.org
This paper analyzes the gender representation in four major corpora of French broadcast.
These corpora being widely used within the speech processing community, they are a …

The airbus air traffic control speech recognition 2018 challenge: Towards ATC automatic transcription and call sign detection

T Pellegrini, J Farinas, E Delpech… - arxiv preprint arxiv …, 2018 - arxiv.org
In this paper, we describe the outcomes of the challenge organized and run by Airbus and
partners in 2018. The challenge consisted of two tasks applied to Air Traffic Control (ATC) …