Challenges of language technologies for the indigenous languages of the Americas

M Mager, X Gutierrez-Vasques, G Sierra… - ar** onsets to enhance syllabification
S Suyanto - International Journal of Speech Technology, 2019 - Springer
Two-year-old children who start learning to speak generally spell a polysyllabic word by
flip** onsets of consecutive syllables. Sometimes they speak unclearly, hard to …

Phonological similarity-based backoff smoothing to boost a bigram syllable boundary detection

S Suyanto - International Journal of Speech Technology, 2020 - Springer
Swap** one or more consonant-graphemes in a word into other phonologically similar
ones, which based on both place and manner of articulation, interestingly produces some …

Schaman: Spell-checking resources and benchmark for endangered languages from amazonia

A Oncevay, G Cardoso, C Alva, CL Ávila… - Proceedings of the …, 2022 - aclanthology.org
Spell-checkers are core applications in language learning and normalisation, which may
enormously contribute to language revitalisation and language teaching in the context of …

Educational tools for mapuzugun

C Ahumada, C Gutierrez, A Anastasopoulos - arxiv preprint arxiv …, 2022 - arxiv.org
Mapuzugun is the language of the Mapuche people. Due to political and historical reasons,
its number of speakers has decreased and the language has been excluded from the …

A morphological analyzer for Shipibo-konibo

R Cardenas, D Zeman - Proceedings of the Fifteenth Workshop …, 2018 - aclanthology.org
We present a fairly complete morphological analyzer for Shipibo-Konibo, a low-resourced
native language spoken in the Amazonian region of Peru. We resort to the robustness of …

[PDF][PDF] A continuous improvement framework of machine translation for Shipibo-konibo

HEG Montoya, KDR Rojas… - Proceedings of the 2nd …, 2019 - aclanthology.org
Shipibo-Konibo is a low-resource language from Peru with prior results in statistical machine
translation; however, it is challenging to enhance them mainly due to the expensiveness of …

Revisiting syllables in language modelling and their application on low-resource machine translation

A Oncevay, KDR Rojas, LKC Sanchez… - arxiv preprint arxiv …, 2022 - arxiv.org
Language modelling and machine translation tasks mostly use subword or character inputs,
but syllables are seldom used. Syllables provide shorter sequences than characters, require …

[PDF][PDF] Chanot: An intelligent annotation tool for indigenous and highly agglutinative languages in Peru

R Mercado-Gonzales, J Pereira-Noriega… - Proceedings of the …, 2018 - aclanthology.org
Linguistic corpus annotation is one of the most important phases for addressing Natural
Language Processing (NLP) tasks, as these methods are deeply involved with corpus …