[PDF][PDF] On achieving and evaluating language-independence in NLP

EM Bender - Linguistic Issues in Language Technology, 2011 - journals.colorado.edu
On Achieving and Evaluating Language-Independence in NLP Page 1 Linguistic Issues in
Language Technology LiLT Submitted, October 2011 On Achieving and Evaluating …

Resources for Turkish natural language processing: A critical survey

Ç Çöltekin, AS Doğruöz, Ö Çetinoğlu - Language resources and …, 2023 - Springer
This paper presents a comprehensive survey of corpora and lexical resources available for
Turkish. We review a broad range of resources, focusing on the ones that are publicly …

Linguistic typology in natural language processing

EM Bender - Linguistic Typology, 2016 - degruyter.com
This paper explores the ways in which the field of natural language processing (NLP) can
and does benefit from work in linguistic typology. I describe the recent increase in interest in …

[PDF][PDF] Comparing language similarity across genetic and typologically-based grou**s

R Georgi, F **a, W Lewis - … of the 23rd international conference on …, 2010 - aclanthology.org
Recent studies have shown the potential benefits of leveraging resources for resource-rich
languages to build tools for similar, but resource-poor languages. We examine what …

[PDF][PDF] Automatically identifying computationally relevant typological features

W Lewis, F **a - Proceedings of the Third International Joint …, 2008 - aclanthology.org
In this paper we explore the potential for identifying computationally relevant typological
features from a multilingual corpus of language data built from readily available language …

[PDF][PDF] Towards creating precision grammars from interlinear glossed text: Inferring large-scale typological properties

EM Bender, MW Goodman, J Crowgey… - Proceedings of the 7th …, 2013 - aclanthology.org
We propose to bring together two kinds of linguistic resources—interlinear glossed text (IGT)
and a language-independent precision grammar resource—to automatically create …

TypeCraft collaborative databasing and resource sharing for linguists

D Beermann, P Mihaylov - Language resources and evaluation, 2014 - Springer
Abstract Interlinear Glossed Text (IGT) is a well established data format within philology and
the structural and generative fields of linguistics. The best known format for an IGT is the one …

The GOLD Community of Practice: An infrastructure for linguistic data on the Web

S Farrar, WD Lewis - Language Resources and Evaluation, 2007 - Springer
Abstract The GOLD Community of Practice is proposed as a model for linking on-line
linguistic data to an ontology. The key components of the model include the linguistic data …

[PDF][PDF] Multilingual structural projection across interlinear text

F **a, W Lewis - … Technologies 2007: The Conference of the North …, 2007 - aclanthology.org
This paper explores the potential for annotating and enriching data for low-density
languages via the alignment and projection of syntactic structure from parsed data for …

[PDF][PDF] Language ID in the context of harvesting language data off the web

F **a, W Lewis, H Poon - Proceedings of the 12th Conference of …, 2009 - aclanthology.org
As the arm of NLP technologies extends beyond a small core of languages, techniques for
working with instances of language data across hundreds to thousands of languages may …