[書籍][B] Sequence comparison in historical linguistics

M List - 2014 - books.google.com
The comparison of sound sequences (words, morphemes) constitutes the core of many
techniques and methods in historical linguistics. With the help of these techniques …

Computational historical linguistics

G Jäger - Theoretical Linguistics, 2019 - degruyter.com
Computational approaches to historical linguistics have been proposed for half a century.
Within the last decade, this line of research has received a major boost, owing both to the …

[PDF][PDF] Clustering semantically equivalent words into cognate sets in multilingual lists

B Hauer, G Kondrak - … of 5th international joint conference on …, 2011 - aclanthology.org
Word lists have become available for most of the world's languages, but only a small fraction
of such lists contain cognate information. We present a machine-learning approach that …

Neural decipherment via minimum-cost flow: From Ugaritic to Linear B

J Luo, Y Cao, R Barzilay - arxiv preprint arxiv:1906.06718, 2019 - arxiv.org
In this paper we propose a novel neural approach for automatic decipherment of lost
languages. To compensate for the lack of strong supervision signal, our model design is …

Modeling word forms using latent underlying morphs and phonology

R Cotterell, N Peng, J Eisner - Transactions of the Association for …, 2015 - direct.mit.edu
The observed pronunciations or spellings of words are often explained as arising from the
“underlying forms” of their morphemes. These forms are latent strings that linguists try to …

[PDF][PDF] Automatic detection of cognates using orthographic alignment

AM Ciobanu, LP Dinu - Proceedings of the 52nd Annual Meeting …, 2014 - aclanthology.org
Abstract Words undergo various changes when entering new languages. Based on the
assumption that these linguistic changes follow certain rules, we propose a method for …

Identifying cognate sets across dictionaries of related languages

A St Arnaud - 2017 - era.library.ualberta.ca
Cognates are words in related languages that have originated from the same word in an
ancestor language, such as the English/German word pair father/Vater. Cognate information …

Can Cognate Prediction Be Modelled as a Low-Resource Machine Translation Task?

C Fourrier, R Bawden, B Sagot - ACL-IJCNLP 2021-Findings of the …, 2021 - inria.hal.science
Cognate prediction is the task of generating, in a given language, the likely cognates of
words in a related language, where cognates are words in related languages that have …

Comparing fifty natural languages and twelve genetic languages using word embedding language divergence (WELD) as a quantitative measure of language …

E Asgari, MRK Mofrad - arxiv preprint arxiv:1604.08561, 2016 - arxiv.org
We introduce a new measure of distance between languages based on word embedding,
called word embedding language divergence (WELD). WELD is defined as divergence …

Neural Approaches to Historical Words Reconstruction

C Fourrier - 2022 - theses.hal.science
In historical linguistics, cognates are words that descend in direct line from a common
ancestor, called their proto-form, and therefore are representative of their respective …