[LIBRO][B] Inductive dependency parsing

J Nivre - 2006 - Springer
Machine learning based on various forms of inductive inference has been used for a wide
range of problems in natural language processing, with syntactic parsing being one of the …

Evaluating superhuman models with consistency checks

L Fluri, D Paleka, F Tramèr - 2024 IEEE Conference on Secure …, 2024 - ieeexplore.ieee.org
If machine learning models were to achieve superhuman abilities at various reasoning or
decision-making tasks, how would we go about evaluating such models, given that humans …

Annotation error detection: Analyzing the past and present for a more coherent future

JC Klie, B Webber, I Gurevych - Computational Linguistics, 2023 - direct.mit.edu
Annotated data is an essential ingredient in natural language processing for training and
evaluating machine learning models. It is therefore very desirable for the annotations to be …

Analyzing Dataset Annotation Quality Management in the Wild

JC Klie, RE de Castilho, I Gurevych - Computational Linguistics, 2024 - direct.mit.edu
Data quality is crucial for training accurate, unbiased, and trustworthy machine learning
models as well as for their correct evaluation. Recent works, however, have shown that even …

The future of corpora in SLA

N Tracy-Ventura, M Paquot, F Myles - The Routledge handbook of …, 2020 - taylorfrancis.com
Learner corpora have generally focused on intermediate and advanced learners/users.
Corpora of beginners are rare but just as important. Second Language Acquisition (SLA) …

[PDF][PDF] Data conversion and consistency of monolingual corpora: Russian UD treebanks

K Droganova, O Lyashevskaya… - Proceedings of the 17th …, 2018 - ufal.mff.cuni.cz
The co-existence of several treebanks for one language, made by different teams and
converted from different sources, within the Universal Dependencies (UD) platform (Nivre et …

Out-of-the-box robust parsing of portuguese

J Silva, A Branco, S Castro, R Reis - … 2010, Porto Alegre, RS, Brazil, April …, 2010 - Springer
In this paper we assess to what extent the available Portuguese treebanks and available
probabilistic parsers are suitable for out-of-the-box robust parsing of Portuguese. We also …

On detecting errors in dependency treebanks

A Boyd, M Dickinson, WD Meurers - Research on Language and …, 2008 - Springer
Dependency relations between words are increasingly recognized as an important level of
linguistic representation that is close to the data and at the same time to the semantic functor …

[PDF][PDF] Conversion et améliorations de corpus du français annotés en Universal Dependencies [Conversion and Improvement of Universal Dependencies French …

B Guillaume, MC de Marneffe… - … automatique des langues, 2019 - aclanthology.org
Cet article décrit l'effort d'amélioration de deux corpus du français annotés en dépendances
syntaxiques, qui s' inscrit dans le cadre du projet Universal Dependencies (UD) qui vise à …

Inconsistency detection in semantic annotation

N Hollenstein, N Schneider… - Proceedings of the Tenth …, 2016 - aclanthology.org
Inconsistencies are part of any manually annotated corpus. Automatically finding these
inconsistencies and correcting them (even manually) can increase the quality of the data …