Automatic speech recognition for under-resourced languages: A survey

L Besacier, E Barnard, A Karpov, T Schultz - Speech communication, 2014 - Elsevier
Speech processing for under-resourced languages is an active field of research, which has
experienced significant progress during the past decade. We propose, in this paper, a …

[PDF][PDF] An attentional model for speech translation without transcription

L Duong, A Anastasopoulos, D Chiang… - Proceedings of the …, 2016 - aclanthology.org
For many low-resource languages, spoken language resources are more likely to be
annotated with translations than transcriptions. This bilingual speech data can be used for …

Breaking the unwritten language barrier: The BULB project

G Adda, S Stüker, M Adda-Decker… - Procedia Computer …, 2016 - Elsevier
Abstract The project Breaking the Unwritten Language Barrier (BULB), which brings together
linguists and computer scientists, aims at supporting linguists in documenting unwritten …

Unsupervised word segmentation from speech with attention

P Godard, M Zanon-Boito, L Ondel, A Berard… - arxiv preprint arxiv …, 2018 - arxiv.org
We present a first attempt to perform attentional word segmentation directly from the speech
signal, with the final goal to automatically identify lexical units in a low-resource, unwritten …

An unsupervised probability model for speech-to-translation alignment of low-resource languages

A Anastasopoulos, D Chiang, L Duong - arxiv preprint arxiv:1609.08139, 2016 - arxiv.org
For many low-resource languages, spoken language resources are more likely to be
annotated with translations than with transcriptions. Translated speech data is potentially …

[KNJIGA][B] Computational tools for endangered language documentation

A Anastasopoulos - 2019 - search.proquest.com
COMPUTATIONAL TOOLS FOR ENDANGERED LANGUAGE DOCUMENTATION A
Dissertation Submitted to the Graduate School of the University of N Page 1 …

Preliminary experiments on unsupervised word discovery in mboshi

P Godard, G Adda, M Adda-Decker, A Allauzen… - Interspeech 2016, 2016 - hal.science
The necessity to document thousands of endangered languages encourages the
collaboration between linguists and computer scientists in order to provide the documentary …

Weakly supervised word segmentation for computational language documentation

S Okabe, L Besacier, F Yvon - Annual meeting of the Association for …, 2022 - hal.science
Word and morpheme segmentation are fundamental steps of language documentation as
they allow to discover lexical units in a language for which the lexicon is unknown. However …

[PDF][PDF] Automatic understanding of unwritten languages

O Adams - 2017 - minerva-access.unimelb.edu.au
Many of the world's languages are falling out of use without a written record and minimal
linguistic documentation. Language documentation is a slow process and there are an …

Bootstrap** text-to-speech for speech processing in languages without an orthography

S Sitaram, S Palkar, YN Chen… - … on Acoustics, Speech …, 2013 - ieeexplore.ieee.org
Speech synthesis technology has reached the stage where given a well-designed corpus of
audio and accurate transcription an at least understandable synthesizer can be built without …