Computational historical linguistics
G Jäger - Theoretical Linguistics, 2019 - degruyter.com
Computational approaches to historical linguistics have been proposed for half a century.
Within the last decade, this line of research has received a major boost, owing both to the …
Within the last decade, this line of research has received a major boost, owing both to the …
[PDF][PDF] Clustering semantically equivalent words into cognate sets in multilingual lists
Word lists have become available for most of the world's languages, but only a small fraction
of such lists contain cognate information. We present a machine-learning approach that …
of such lists contain cognate information. We present a machine-learning approach that …
Neural decipherment via minimum-cost flow: From Ugaritic to Linear B
In this paper we propose a novel neural approach for automatic decipherment of lost
languages. To compensate for the lack of strong supervision signal, our model design is …
languages. To compensate for the lack of strong supervision signal, our model design is …
Modeling word forms using latent underlying morphs and phonology
The observed pronunciations or spellings of words are often explained as arising from the
“underlying forms” of their morphemes. These forms are latent strings that linguists try to …
“underlying forms” of their morphemes. These forms are latent strings that linguists try to …
[PDF][PDF] Automatic detection of cognates using orthographic alignment
Abstract Words undergo various changes when entering new languages. Based on the
assumption that these linguistic changes follow certain rules, we propose a method for …
assumption that these linguistic changes follow certain rules, we propose a method for …
Identifying cognate sets across dictionaries of related languages
A St Arnaud - 2017 - era.library.ualberta.ca
Cognates are words in related languages that have originated from the same word in an
ancestor language, such as the English/German word pair father/Vater. Cognate information …
ancestor language, such as the English/German word pair father/Vater. Cognate information …
Can Cognate Prediction Be Modelled as a Low-Resource Machine Translation Task?
Cognate prediction is the task of generating, in a given language, the likely cognates of
words in a related language, where cognates are words in related languages that have …
words in a related language, where cognates are words in related languages that have …
Comparing fifty natural languages and twelve genetic languages using word embedding language divergence (WELD) as a quantitative measure of language …
We introduce a new measure of distance between languages based on word embedding,
called word embedding language divergence (WELD). WELD is defined as divergence …
called word embedding language divergence (WELD). WELD is defined as divergence …
Neural Approaches to Historical Words Reconstruction
C Fourrier - 2022 - theses.hal.science
In historical linguistics, cognates are words that descend in direct line from a common
ancestor, called their proto-form, and therefore are representative of their respective …
ancestor, called their proto-form, and therefore are representative of their respective …