How efficiency shapes human language

E Gibson, R Futrell, SP Piantadosi, I Dautriche… - Trends in cognitive …, 2019 - cell.com
Cognitive science applies diverse tools and perspectives to study human language.
Recently, an exciting body of work has examined linguistic phenomena through the lens of …

Dependency grammar

MC De Marneffe, J Nivre - Annual Review of Linguistics, 2019 - annualreviews.org
Dependency grammar is a descriptive and theoretical tradition in linguistics that can be
traced back to antiquity. It has long been influential in the European linguistics tradition and …

Quantifying social biases in NLP: A generalization and empirical comparison of extrinsic fairness metrics

P Czarnowska, Y Vyas, K Shah - Transactions of the Association for …, 2021 - direct.mit.edu
Measuring bias is key for better understanding and addressing unfairness in NLP/ML
models. This is often done via fairness metrics, which quantify the differences in a model's …

Data augmentation via dependency tree morphing for low-resource languages

GG Şahin, M Steedman - arxiv preprint arxiv:1903.09460, 2019 - arxiv.org
Neural NLP systems achieve high scores in the presence of sizable training dataset. Lack of
such datasets leads to poor system performances in the case low-resource languages. We …

Token-based typology and word order entropy: A study based on Universal Dependencies

N Levshina - Linguistic Typology, 2019 - degruyter.com
The present paper discusses the benefits and challenges of token-based typology, which
takes into account the frequencies of words and constructions in language use. This …

The entropy of words—Learnability and expressivity across more than 1000 languages

C Bentz, D Alikaniotis, M Cysouw, R Ferrer-i-Cancho - Entropy, 2017 - mdpi.com
The choice associated with words is a fundamental property of natural languages. It lies at
the heart of quantitative linguistics, computational linguistics and language sciences more …

Universals of word order reflect optimization of grammars for efficient communication

M Hahn, D Jurafsky, R Futrell - Proceedings of the National Academy of …, 2020 - pnas.org
The universal properties of human languages have been the subject of intense study across
the language sciences. We report computational and corpus evidence for the hypothesis …

BERT syntactic transfer: A computational experiment on Italian, French and English languages

R Guarasci, S Silvestri, G De Pietro, H Fujita… - Computer Speech & …, 2022 - Elsevier
This paper investigates the ability of multilingual BERT (mBERT) language model to transfer
syntactic knowledge cross-lingually, verifying if and to which extent syntactic dependency …

The Natural Stories corpus: a reading-time corpus of English texts containing rare syntactic constructions

R Futrell, E Gibson, HJ Tily, I Blank… - Language Resources …, 2021 - Springer
It is now a common practice to compare models of human language processing by
comparing how well they predict behavioral and neural measures of processing difficulty …

Why we need a gradient approach to word order

N Levshina, S Namboodiripad, M Allassonnière-Tang… - Linguistics, 2023 - degruyter.com
This article argues for a gradient approach to word order, which treats word order
preferences, both within and across languages, as a continuous variable. Word order …