Decolonising speech and language technology

S Bird - 28th International Conference on Computational …, 2020 - researchers.cdu.edu.au
After generations of exploitation, Indigenous people often respond negatively to the idea that
their languages are data ready for the taking. By treating Indigenous knowledge as a …

Comprehensive literature review on children automatic speech recognition system, acoustic linguistic mismatch approaches and challenges

R Sobti, K Guleria, V Kadyan - Multimedia Tools and Applications, 2024 - Springer
Abstract Automatic Speech Recognition (ASR) system for children is as important as for
adults since children are more dependent on these systems nowadays, such as computer …

[PDF][PDF] Leveraging orthographic information to improve machine translation of under-resourced languages

A Chakravarthi, B Raja - 2020 - researchrepository …
This thesis presents Bharathi Raja Asoka Chakravarthi's work on machine translation for
underresourced languages. As the internet becomes increasingly available to the whole …

Not always about you: Prioritizing community needs when develo** endangered language technology

Z Liu, C Richardson, R Hatcher Jr… - arxiv preprint arxiv …, 2022 - arxiv.org
Languages are classified as low-resource when they lack the quantity of data necessary for
training statistical and machine learning tools and models. Causes of resource scarcity vary …

Indigenous language technologies in Canada: Assessment, challenges, and successes

P Littell, A Kazantseva, R Kuhn, A Pine… - Proceedings of the …, 2018 - aclanthology.org
In this article, we discuss which text, speech, and image technologies have been developed,
and would be feasible to develop, for the approximately 60 Indigenous languages spoken in …

Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yolox\'ochitl Mixtec

J Shi, JD Amith, RC García, EG Sierra, K Duh… - arxiv preprint arxiv …, 2021 - arxiv.org
" Transcription bottlenecks", created by a shortage of effective human transcribers are one of
the main challenges to endangered language (EL) documentation. Automatic speech …

A survey of orthographic information in machine translation

BR Chakravarthi, P Rani, M Arcan, JP McCrae - SN computer science, 2021 - Springer
Abstract Machine translation is one of the applications of natural language processing which
has been explored in different languages. Recently researchers started paying attention …

Automatic interlinear glossing for under-resourced languages leveraging translations

X Zhao, S Ozaki, A Anastasopoulos… - Proceedings of the …, 2020 - aclanthology.org
Abstract Interlinear Glossed Text (IGT) is a widely used format for encoding linguistic
information in language documentation projects and scholarly papers. Manual production of …

Development of automatic speech recognition for the documentation of Cook Islands Māori

R Coto-Solano, SA Nicholas, S Datta, V Quint, P Wills… - 2022 - mro.massey.ac.nz
This paper describes the process of data processing and training of an automatic speech
recognition (ASR) system for Cook Islands Māori (CIM), an Indigenous language spoken by …

[PDF][PDF] Indigenous language technologies: Anti-colonial oases in a colonizing (digital) world

NT Brinklow - WINHEC: International Journal of Indigenous …, 2021 - journals.uvic.ca
Abstract Language technologies are an increasingly common part of daily life for people
around the world. Millions of users per day access services like Google Translate and …