Multiword expression processing: A survey

M Constant, G Eryiğit, J Monti, L Van Der Plas… - Computational …, 2017 - direct.mit.edu
Multiword expressions (MWEs) are a class of linguistic forms spanning conventional word
boundaries that are both idiosyncratic and pervasive across different languages. The …

Improved transition-based parsing by modeling characters instead of words with LSTMs

M Ballesteros, C Dyer, NA Smith - arxiv preprint arxiv:1508.00657, 2015 - arxiv.org
We present extensions to a continuous-state dependency parsing method that makes it
applicable to morphologically rich languages. Starting with a high-performance transition …

Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP

R Van Der Goot, A Üstün, A Ramponi, I Sharaf… - arxiv preprint arxiv …, 2020 - arxiv.org
Transfer learning, particularly approaches that combine multi-task learning with pre-trained
contextualized embeddings and fine-tuning, have advanced the field of Natural Language …

Overview of the SPMRL 2013 shared task: A cross-framework evaluation of parsing morphologically rich languages

D Seddah, R Tsarfaty, S Kübler, M Candito… - Proceedings of the …, 2013 - hal.science
This paper reports on the first shared task on statistical parsing of morphologically rich lan-
guages (MRLs). The task features data sets from nine languages, each available both in …

[PDF][PDF] magyarlanc: A tool for morphological and dependency parsing of hungarian

J Zsibrita, V Vincze, R Farkas - Proceedings of the International …, 2013 - aclanthology.org
Hungarian is the stereotype of morphologically rich and free word order languages. Here,
we introduce magyarlanc, a natural language toolkit developed for the linguistic …

Building the essential resources for Finnish: the Turku Dependency Treebank

K Haverinen, J Nyblom, T Viljanen, V Laippala… - Language Resources …, 2014 - Springer
In this paper, we present the final version of a publicly available treebank of Finnish, the
Turku Dependency Treebank. The treebank contains 204,399 tokens (15,126 sentences) …

[PDF][PDF] Introducing the SPMRL 2014 shared task on parsing morphologically-rich languages

D Seddah, S Kübler, R Tsarfaty - … of the First Joint Workshop on …, 2014 - aclanthology.org
This first joint meeting on Statistical Parsing of Morphologically Rich Languages and
Syntactic Analysis of Non-Canonical English (SPMRL-SANCL) featured a shared task on …

What's in an embedding? Analyzing word embeddings through multilingual evaluation

A Köhn - 2015 - edoc.sub.uni-hamburg.de
In the last two years, there has been a surge of word embedding algorithms and research on
them. However, evaluation has mostly been carried out on a narrow set of tasks, mainly …

Morphynet: a large multilingual database of derivational and inflectional morphology

K Batsuren, G Bella, F Giunchiglia - Proceedings of the 18th …, 2021 - aclanthology.org
Large-scale morphological databases provide essential input to a wide range of NLP
applications. Inflectional data is of particular importance for morphologically rich …

Creation of an annotated corpus of Old and Middle Hungarian court records and private correspondence

A Novák, K Gugán, M Varga, A Dömötör - Language Resources and …, 2018 - Springer
The paper introduces a novel annotated corpus of Old and Middle Hungarian (16–18
century), the texts of which were selected in order to approximate the vernacular of the given …