When linguistics meets web technologies. Recent advances in modelling linguistic linked data

AF Khan, C Chiarcos, T Declerck, D Gifu… - Semantic …, 2022 - content.iospress.com
When linguistics meets web technologies. Recent advances in modelling linguistic linked data
- IOS Press You are viewing a javascript disabled version of the site. Please enable Javascript …

Automatic interlinear glossing for under-resourced languages leveraging translations

X Zhao, S Ozaki, A Anastasopoulos… - Proceedings of the …, 2020 - aclanthology.org
Abstract Interlinear Glossed Text (IGT) is a widely used format for encoding linguistic
information in language documentation projects and scholarly papers. Manual production of …

Automating gloss generation in interlinear glossed text

A McMillan-Major - Society for Computation in …, 2020 - openpublishing.library.umass.edu
Abstract Interlinear Glossed Text (IGT) is a rich data type produced by linguists for the
purposes of presenting an analysis of a language\'s semantic and grammatical properties. I …

Generalized Glossing Guidelines: An Explicit, Human-and Machine-Readable, Item-and-Process Convention for Morphological Annotation

DR Mortensen, E Gulsen, T He… - Proceedings of the …, 2023 - aclanthology.org
Interlinear glossing provides a vital type of morphosyntactic annotation, both for linguists and
language revitalists, and numerous conventions exist for representing it formally and …

IMTVault: Extracting and enriching low-resource language interlinear glossed text from grammatical descriptions and typological survey articles

S Nordhoff, T Krämer - Proceedings of the 8th Workshop on …, 2022 - aclanthology.org
Many NLP resources and programs focus on a handful of major languages. But there are
thousands of languages with low or no resources available as structured data. This paper …

From Aari to Zulu: massively multilingual creation of language tools using interlinear glossed text

RA Georgi - 2016 - digital.lib.washington.edu
This dissertation examines the suitability of Interlinear Glossed Text (IGT) as a
computational, semi-structured resource for creating NLP tools for resource-poor languages …

Enriching a massively multilingual database of interlinear glossed text

F **a, WD Lewis, MW Goodman, G Slayden… - Language Resources …, 2016 - Springer
The majority of the world's languages have little to no NLP resources or tools. This is due to
a lack of training data (“resources”) over which tools, such as taggers or parsers, can be …

Modelling and annotating interlinear glossed text from 280 different endangered languages as linked data with LIGT

S Nordhoff - Proceedings of the 14th Linguistic Annotation …, 2020 - aclanthology.org
This paper reports on the harvesting, analysis, and enrichment of 20k documents from 4
different endangered language archives in 300 different low-resource languages. The …

[BUCH][B] Assembling syntax: Modeling constituent questions in a grammar engineering framework

O Zamaraeva - 2021 - search.proquest.com
This dissertation is dedicated to a cross-linguistic account of constituent (aka wh-) questions
as part of a grammar engineering toolkit, the Grammar Matrix, couched in the Head-driven …

[PDF][PDF] Enriching ODIN.

F **a, WD Lewis, MW Goodman, J Crowgey… - LREC, 2014 - lrec-conf.org
In this paper, we describe the expansion of the ODIN resource, a database containing many
thousands of instances of Interlinear Glossed Text (IGT) for over a thousand languages. A …