Data-driven dependency parsing of Vedic Sanskrit

O Hellwig, S Nehrdich, S Sellmer - Language Resources and Evaluation, 2023 - Springer
This paper describes the first data-driven parser for Vedic Sanskrit, an ancient Indo-Aryan
language in which a corpus of important religious and philosophical texts has been …

The GLAUx corpus: methodological issues in designing a long-term, diverse, multi-layered corpus of Ancient Greek

A Keersmaekers - Proceedings of the 2nd International Workshop …, 2021 - aclanthology.org
This paper describes the GLAUx project (“the Greek Language Automated”), an ongoing
effort to develop a large long-term diachronic corpus of Greek, covering sixteen centuries of …

[PDF][PDF] Creating, enriching and valorizing treebanks of Ancient Greek

A Keersmaekers, W Mercelis… - Proceedings of the …, 2019 - lirias.kuleuven.be
This paper shows the extent to which treebanks of Ancient Greek play a central role in the
ongoing Pedalion project at the University of Leuven. Building on diverse treebanks readily …

[HTML][HTML] PapyGreek treebanks: A dataset of linguistically annotated Greek documentary papyri

M Vierros, E Henriksson - Journal of open …, 2021 - openhumanitiesdata.metajnl.com
Abstract The PapyGreek Treebanks dataset contains documentary texts written in
Postclassical Greek (ca. 300 BCE–700 CE), morphosyntactically annotated according to …

Creating a richly annotated corpus of papyrological Greek: The possibilities of natural language processing approaches to a highly inflected historical language

A Keersmaekers - Digital Scholarship in the Humanities, 2020 - academic.oup.com
This article describes a first attempt to annotate the full Greek papyrus corpus automatically
for linguistic information. It gives an overview of existing work on Ancient Greek and …

[PDF][PDF] A computational approach to the Greek papyri: Develo** a corpus to study variation and change in the post-classical Greek complementation system

A Keersmaekers - 2020 - lirias.kuleuven.be
The aim of this PhD project is to advance the corpus-linguistic study of the Greek papyri, a
large diachronic corpus (3rd century BC-8th century AD) of non-literary Greek. It consists of …

Creating a large-scale diachronic corpus resource: Automated parsing in the Greek papyri (and beyond)

A Keersmaekers, T Van Hal - Natural Language Engineering, 2024 - cambridge.org
This paper explores how to syntactically parse Ancient Greek texts automatically and maps
ways of fruitfully employing the results of such an automated analysis. Special attention is …

[PDF][PDF] The Ancient Greek Dependency Treebank: Linguistic Annotation in a Teaching Environment

F Mambrini - Digital Classics Outside the Echo-Chamber, 2016 - ubiquitypress.com
This chapter argues that manual linguistic annotation of Ancient Greek texts can be
effectively employed to teach of Greek literature and languages. Under the supervision of a …

Preprocessing Greek Papyri for linguistic annotation

M Vierros, E Henriksson - Journal of Data Mining & Digital …, 2017 - jdmdh.episciences.org
Greek documentary papyri form an important direct source for Ancient Greek. It has been
exploited surprisingly little in Greek linguistics due to a lack of good tools for searching …

[PDF][PDF] Non-projectivity in the Ancient Greek dependency treebank

F Mambrini, M Passarotti - Proceedings of the second …, 2013 - aclanthology.org
In this paper, we provide a quantitative analysis of non-projective constructions attested in
the Ancient Greek Dependency Treebank (AGDT). We consider the different types of formal …