[PDF][PDF] CoNLL-X shared task on multilingual dependency parsing
S Buchholz, E Marsi - Proceedings of the tenth conference on …, 2006 - aclanthology.org
Each year the Conference on Computational Natural Language Learning (CoNLL) 1
features a shared task, in which participants train and test their systems on exactly the same …
features a shared task, in which participants train and test their systems on exactly the same …
HuSpaCy: an industrial-strength Hungarian natural language processing toolkit
Although there are a couple of open-source language processing pipelines available for
Hungarian, none of them satisfies the requirements of today's NLP applications. A language …
Hungarian, none of them satisfies the requirements of today's NLP applications. A language …
A multilingual named entity recognition system using boosting and c4. 5 decision tree learning algorithms
In this paper we introduce a multilingual Named Entity Recognition (NER) system that uses
statistical modeling techniques. The system identifies and classifies NEs in the Hungarian …
statistical modeling techniques. The system identifies and classifies NEs in the Hungarian …
The szeged treebank
D Csendes, J Csirik, T Gyimóthy, A Kocsor - Text, Speech and Dialogue …, 2005 - Springer
The major aim of the Szeged Treebank project was to create a high-quality database of
syntactic structures for Hungarian that can serve as a golden standard to further research in …
syntactic structures for Hungarian that can serve as a golden standard to further research in …
[PDF][PDF] HunPos-an open source trigram tagger
In the world of non-proprietary NLP software the standard, and perhaps the best, HMM-
based POS tagger is TnT (Brants, 2000). We argue here that some of the criticism aimed at …
based POS tagger is TnT (Brants, 2000). We argue here that some of the criticism aimed at …
Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines
This paper presents a set of industrial-grade text processing models for Hungarian that
achieve near state-of-the-art performance while balancing resource efficiency and accuracy …
achieve near state-of-the-art performance while balancing resource efficiency and accuracy …
[PDF][PDF] A highly accurate Named Entity corpus for Hungarian
A highly accurate Named Entity (NE) corpus for Hungarian that is publicly available for
research purposes is introduced in the paper, along with its main properties. The results of …
research purposes is introduced in the paper, along with its main properties. The results of …
Morphological and syntactic case in statistical dependency parsing
W Seeker, J Kuhn - Computational Linguistics, 2013 - direct.mit.edu
Most morphologically rich languages with free word order use case systems to mark the
grammatical function of nominal elements, especially for the core argument functions of a …
grammatical function of nominal elements, especially for the core argument functions of a …
[PDF][PDF] Web-based frequency dictionaries for medium density languages
Frequency dictionaries play an important role both in psycholinguistic experiment design
and in language technology. The paper describes a new, freely available, web-based …
and in language technology. The paper describes a new, freely available, web-based …
Context-aware correction of spelling errors in Hungarian medical documents
Owing to the growing need of acquiring medical data from clinical records, processing such
documents is an important topic in natural language processing (NLP). However, for general …
documents is an important topic in natural language processing (NLP). However, for general …