Computational sociophonetics using automatic speech recognition
Recent years have seen numerous advances in natural language processing that can help
accelerate sociophonetic work. These include software to align speech recordings with their …
accelerate sociophonetic work. These include software to align speech recordings with their …
Automatic speech recognition for supporting endangered language documentation
E Prud'hommeaux, R Jimerson, R Hatcher… - 2021 - scholarspace.manoa.hawaii.edu
Generating accurate word-level transcripts of recorded speech for language documentation
is difficult and time-consuming, even for skilled speakers of the target language. Automatic …
is difficult and time-consuming, even for skilled speakers of the target language. Automatic …
" It's how you do things that matters": Attending to Process to Better Serve Indigenous Communities with Language Technologies
Indigenous languages are historically under-served by Natural Language Processing (NLP)
technologies, but this is changing for some languages with the recent scaling of large …
technologies, but this is changing for some languages with the recent scaling of large …
Sparse transcription
The transcription bottleneck is often cited as a major obstacle for efforts to document the
world's endangered languages and supply them with language technologies. One solution …
world's endangered languages and supply them with language technologies. One solution …
Writing system and speaker metadata for 2,800+ language varieties
We describe an open-source dataset providing metadata for about 2,800 language varieties
used in the world today. Specifically, the dataset provides the attested writing system (s) for …
used in the world today. Specifically, the dataset provides the attested writing system (s) for …
Recent advances in technologies for resource creation and mobilization in language documentation
Language documentation as a subfield of linguistics has arisen over the past roughly two
and a half decades more or less simultaneously with the widespread availability of …
and a half decades more or less simultaneously with the widespread availability of …
Development of automatic speech recognition for the documentation of Cook Islands Māori
R Coto-Solano, SA Nicholas, S Datta, V Quint, P Wills… - 2022 - mro.massey.ac.nz
This paper describes the process of data processing and training of an automatic speech
recognition (ASR) system for Cook Islands Māori (CIM), an Indigenous language spoken by …
recognition (ASR) system for Cook Islands Māori (CIM), an Indigenous language spoken by …
[PDF][PDF] Balancing Social Impact, Opportunities, and Ethical Constraints of Using AI in the Documentation and Vitalization of Indigenous Languages.
In this paper we discuss how AI can contribute to support the documentation and vitalization
of Indigenous languages and how that involves a delicate balancing of ensuring social …
of Indigenous languages and how that involves a delicate balancing of ensuring social …
Learning from failure: Data capture in an australian aboriginal community
Most low resource language technology development is premised on the need to collect
data for training statistical models. When we follow the typical process of recording and …
data for training statistical models. When we follow the typical process of recording and …
Enabling interactive transcription in an indigenous community
We propose a novel transcription workflow which combines spoken term detection and
human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero …
human-in-the-loop, together with a pilot experiment. This work is grounded in an almost zero …