A standardized Project Gutenberg corpus for statistical analysis of natural language and quantitative linguistics

M Gerlach, F Font-Clos - Entropy, 2020 - mdpi.com
The use of Project Gutenberg (PG) as a text corpus has been extremely popular in statistical
analysis of language for more than 25 years. However, in contrast to other major linguistic …

Slangvolution: A causal analysis of semantic change and frequency dynamics in slang

D Keidar, A Opedal, Z **, M Sachan - arxiv preprint arxiv:2203.04651, 2022 - arxiv.org
Languages are continuously undergoing changes, and the mechanisms that underlie these
changes are still a matter of debate. In this work, we approach language evolution through …

Evolving linguistic divergence on polarizing social media

A Karjus, C Cuskley - Humanities and Social Sciences …, 2024 - nature.com
Abstract Language change is influenced by many factors, but often starts from synchronic
variation, where multiple linguistic patterns or forms coexist, or where different speech …

Individuality in complex systems: A constructionist approach

P Petré, L Anthonissen - Cognitive Linguistics, 2020 - degruyter.com
For a long time, linguists more or less denied the existence of individual differences in
grammatical knowledge. While recent years have seen an explosion of research on …

The evolution of structural genomics

DM Standley, T Nakanishi, Z Xu, S Haruna, S Li… - Biophysical …, 2022 - Springer
Structural genomics began as a global effort in the 1990s to determine the tertiary structures
of all protein families as a response to large-scale genome sequencing projects. The …

Individuality in syntactic variation: An investigation of the seventeenth-century gerund alternation

L Fonteyn, A Nini - Cognitive Linguistics, 2020 - degruyter.com
This study investigates the extent to which there is individuality in how structural variation is
conditioned over time. Earlier research already classified the diachronically unstable gerund …

The Royal Society Corpus 6.0: Providing 300+ years of scientific writing for humanistic study

S Fischer, J Knappen, K Menzel… - Proceedings of the Twelfth …, 2020 - aclanthology.org
We present a new, extended version of the Royal Society Corpus (RSC), a diachronic
corpus of scientific English now covering 300+ years of scientific writing (1665–1996). The …

Semantic journeys: quantifying change in emoji meaning from 2012-2018

A Robertson, FF Liza, D Nguyen, B McGillivray… - arxiv preprint arxiv …, 2021 - arxiv.org
The semantics of emoji has, to date, been considered from a static perspective. We offer the
first longitudinal study of how emoji semantics changes over time, applying techniques from …

[PDF][PDF] Emprunts en français contemporain: étude linguistique et statistique à partir de la plateforme Néoveille

E Cartier - L'emprunt en question (s): conceptions, réceptions …, 2019 - hal.science
Parmi l'ensemble des procédés néologiques, l'emprunt occupe une place à part puisque le
matériau provient d'un autre système linguistique. Il occupe également une place de choix …

Is language change chiefly a social diffusion affair? The role of entrenchment in frequency increase and in the emergence of complex structural patterns

Q Feltgen - Frontiers in Complex Systems, 2024 - frontiersin.org
Complex systems research has chiefly investigated language change from a social
dynamics perspective, with undeniable success. However, there is more to language …