[HTML][HTML] Unsupervised automatic speech recognition: A review
Abstract Automatic Speech Recognition (ASR) systems can be trained to achieve
remarkable performance given large amounts of manually transcribed speech, but large …
remarkable performance given large amounts of manually transcribed speech, but large …
Unsupervised cross-lingual transfer of word embedding spaces
Context-aware cross-lingual map**
Cross-lingual word vectors are typically obtained by fitting an orthogonal matrix that maps
the entries of a bilingual dictionary from a source to a target vector space. Word vectors …
the entries of a bilingual dictionary from a source to a target vector space. Word vectors …
Understanding models understanding language
A Søgaard - Synthese, 2022 - Springer
Abstract Landgrebe and Smith (Synthese 198 (March): 2061–2081, 2021) present an
unflattering diagnosis of recent advances in what they call language-centric artificial …
unflattering diagnosis of recent advances in what they call language-centric artificial …
Using optimal transport as alignment objective for fine-tuning multilingual contextualized embeddings
Recent studies have proposed different methods to improve multilingual word
representations in contextualized settings including techniques that align between source …
representations in contextualized settings including techniques that align between source …
NORMA: Neighborhood sensitive maps for multilingual word embeddings
N Nakashole - Proceedings of the 2018 Conference on Empirical …, 2018 - aclanthology.org
Inducing multilingual word embeddings by learning a linear map between embedding
spaces of different languages achieves remarkable accuracy on related languages …
spaces of different languages achieves remarkable accuracy on related languages …
Addressing noise in multidialectal word embeddings
Word embeddings are crucial to many natural language processing tasks. The quality of
embeddings relies on large non-noisy corpora. Arabic dialects lack large corpora and are …
embeddings relies on large non-noisy corpora. Arabic dialects lack large corpora and are …
Analogy training multilingual encoders
Abstract Language encoders encode words and phrases in ways that capture their local
semantic relatedness, but are known to be globally inconsistent. Global inconsistency can …
semantic relatedness, but are known to be globally inconsistent. Global inconsistency can …
LessLex: Linking multilingual embeddings to SenSe representations of LE**cal items
We present LESSLEX, a novel multilingual lexical resource. Different from the vast majority
of existing approaches, we ground our embeddings on a sense inventory made available …
of existing approaches, we ground our embeddings on a sense inventory made available …
Bilingual lexicon induction for low-resource languages using graph matching via optimal transport
Bilingual lexicons form a critical component of various natural language processing
applications, including unsupervised and semisupervised machine translation and …
applications, including unsupervised and semisupervised machine translation and …