[HTML][HTML] Unsupervised automatic speech recognition: A review

H Aldarmaki, A Ullah, S Ram, N Zaki - Speech Communication, 2022 - Elsevier
Abstract Automatic Speech Recognition (ASR) systems can be trained to achieve
remarkable performance given large amounts of manually transcribed speech, but large …

Unsupervised cross-lingual transfer of word embedding spaces

R Xu, Y Yang, N Otani, Y Wu - ar**s among
words in different languages by learning the transformation functions over the corresponding …

Context-aware cross-lingual map**

H Aldarmaki, M Diab - arxiv preprint arxiv:1903.03243, 2019 - arxiv.org
Cross-lingual word vectors are typically obtained by fitting an orthogonal matrix that maps
the entries of a bilingual dictionary from a source to a target vector space. Word vectors …

Understanding models understanding language

A Søgaard - Synthese, 2022 - Springer
Abstract Landgrebe and Smith (Synthese 198 (March): 2061–2081, 2021) present an
unflattering diagnosis of recent advances in what they call language-centric artificial …

Using optimal transport as alignment objective for fine-tuning multilingual contextualized embeddings

S Alqahtani, G Lalwani, Y Zhang, S Romeo… - arxiv preprint arxiv …, 2021 - arxiv.org
Recent studies have proposed different methods to improve multilingual word
representations in contextualized settings including techniques that align between source …

NORMA: Neighborhood sensitive maps for multilingual word embeddings

N Nakashole - Proceedings of the 2018 Conference on Empirical …, 2018 - aclanthology.org
Inducing multilingual word embeddings by learning a linear map between embedding
spaces of different languages achieves remarkable accuracy on related languages …

Addressing noise in multidialectal word embeddings

A Erdmann, N Zalmout, N Habash - … of the 56th Annual Meeting of …, 2018 - aclanthology.org
Word embeddings are crucial to many natural language processing tasks. The quality of
embeddings relies on large non-noisy corpora. Arabic dialects lack large corpora and are …

Analogy training multilingual encoders

N Garneau, M Hartmann, A Sandholm… - Proceedings of the …, 2021 - ojs.aaai.org
Abstract Language encoders encode words and phrases in ways that capture their local
semantic relatedness, but are known to be globally inconsistent. Global inconsistency can …

LessLex: Linking multilingual embeddings to SenSe representations of LE**cal items

D Colla, E Mensa, DP Radicioni - Computational Linguistics, 2020 - direct.mit.edu
We present LESSLEX, a novel multilingual lexical resource. Different from the vast majority
of existing approaches, we ground our embeddings on a sense inventory made available …

Bilingual lexicon induction for low-resource languages using graph matching via optimal transport

K Marchisio, A Saad-Eldin, K Duh, C Priebe… - arxiv preprint arxiv …, 2022 - arxiv.org
Bilingual lexicons form a critical component of various natural language processing
applications, including unsupervised and semisupervised machine translation and …