Hey ASR system! Why aren't you more inclusive? Automatic speech recognition systems' bias and proposed bias mitigation techniques. A literature review

MK Ngueajio, G Washington - International conference on human …, 2022 - Springer
Speech is the fundamental means of communication between humans. The advent of AI and
sophisticated speech technologies have led to the rapid proliferation of human to computer …

Adaptation algorithms for neural network-based speech recognition: An overview

P Bell, J Fainberg, O Klejch, J Li… - IEEE Open Journal …, 2020 - ieeexplore.ieee.org
We present a structured overview of adaptation algorithms for neural network-based speech
recognition, considering both hybrid hidden Markov model/neural network systems and end …

Meta-learning in neural networks: A survey

T Hospedales, A Antoniou, P Micaelli… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …

[HTML][HTML] Towards inclusive automatic speech recognition

S Feng, BM Halpern, O Kudina… - Computer Speech & …, 2024 - Elsevier
Practice and recent evidence show that state-of-the-art (SotA) automatic speech recognition
(ASR) systems do not perform equally well for all speaker groups. Many factors can cause …

Crossner: Evaluating cross-domain named entity recognition

Z Liu, Y Xu, T Yu, W Dai, Z Ji, S Cahyawijaya… - Proceedings of the …, 2021 - ojs.aaai.org
Cross-domain named entity recognition (NER) models are able to cope with the scarcity
issue of NER samples in target domains. However, most of the existing NER benchmarks …

AdaptSum: Towards low-resource domain adaptation for abstractive summarization

T Yu, Z Liu, P Fung - arxiv preprint arxiv:2103.11332, 2021 - arxiv.org
State-of-the-art abstractive summarization models generally rely on extensive labeled data,
which lowers their generalization ability on domains where such data are not available. In …

[HTML][HTML] Evaluating OpenAI's Whisper ASR: Performance analysis across diverse accents and speaker traits

C Graham, N Roll - JASA Express Letters, 2024 - pubs.aip.org
This study investigates Whisper's automatic speech recognition (ASR) system performance
across diverse native and non-native English accents. Results reveal superior recognition in …

Coach: A coarse-to-fine approach for cross-domain slot filling

Z Liu, GI Winata, P Xu, P Fung - arxiv preprint arxiv:2004.11727, 2020 - arxiv.org
As an essential task in task-oriented dialog systems, slot filling requires extensive training
data in a certain domain. However, such data are not always available. Hence, cross …

Meta learning for natural language processing: A survey

H Lee, SW Li, NT Vu - arxiv preprint arxiv:2205.01500, 2022 - arxiv.org
Deep learning has been the mainstream technique in natural language processing (NLP)
area. However, the techniques require many labeled data and are less generalizable across …