Anchored correlation explanation: Topic modeling with minimal domain knowledge

RJ Gallagher, K Reing, D Kale… - Transactions of the …, 2017 - direct.mit.edu
While generative models such as Latent Dirichlet Allocation (LDA) have proven fruitful in
topic modeling, they often require detailed assumptions and careful specification of …

Low-resource cross-lingual event type detection via distant supervision with minimal effort

AO Muis, N Otani, N Vyas, R Xu, Y Yang… - Proceedings of the …, 2018 - aclanthology.org
The use of machine learning for NLP generally requires resources for training. Tasks
performed in a low-resource language usually rely on labeled data in another, typically …

Automatic speech recognition and topic identification for almost-zero-resource languages

M Wiesner, C Liu, L Ondel, C Harman… - arxiv preprint arxiv …, 2018 - arxiv.org
Automatic speech recognition (ASR) systems often need to be developed for extremely low-
resource languages to serve end-uses such as audio content categorization and search …

[KNIHA][B] Issues in Uyghur backness harmony: Corpus, experimental, and computational studies

C Mayer - 2021 - search.proquest.com
This dissertation investigates backness harmony in Uyghur (Turkic: China) from a variety of
methodological and analytical perspectives. Backness harmony is a phenomenon where …

[KNIHA][B] Cross-lingual and low-resource sentiment analysis

N Farra - 2019 - search.proquest.com
Identifying sentiment in a low-resource language is essential for understanding opinions
internationally and for responding to the urgent needs of locals affected by disaster incidents …

The ariel-cmu systems for lorehlt18

A Chaudhary, S Dalmia, J Hu, X Li, A Matthews… - arxiv preprint arxiv …, 2019 - arxiv.org
This paper describes the ARIEL-CMU submissions to the Low Resource Human Language
Technologies (LoReHLT) 2018 evaluations for the tasks Machine Translation (MT), Entity …

[PDF][PDF] A large-scale corpus study of phonological opacity in Uyghur

C Mayer - Ms., University of California, Irvine, 2023 - sites.socsci.uci.edu
This paper examines a case of phonological opacity in Uyghur that results from an
interaction between backness harmony and a vowel reduction process that converts …

Humanitarian Corpora for English, French and Spanish

L Isaacs, S Chambó, P León-Araúz - Proceedings of the 2024 …, 2024 - aclanthology.org
This paper presents three corpora of English, French and Spanish humanitarian documents
compiled with reports obtained from ReliefWeb through its API. ReliefWeb is a leading …

[PDF][PDF] Using prosody to find mentions of urgent problems in radio broadcasts

NG Ward, JA Jodoin, A Nath, O Fuentes - Speech Prosody, 2020 - isca-archive.org
This paper examines whether prosodic information is usefully indicative of urgency and
related attributes of situations in news broadcasts. We find that, in all 8 languages studied …

[PDF][PDF] FACULTAD DE CIENCIAS NATURALES Y EXACTAS

AB García - researchgate.net
Hoy en día, el porcentaje de la información disponible en inglés en Word Wide Web está
disminuyendo, debido a que otros lenguajes como: chino, español, árabe y portugués están …