[PDF][PDF] CoNLL-X shared task on multilingual dependency parsing

S Buchholz, E Marsi - Proceedings of the tenth conference on …, 2006 - aclanthology.org
Each year the Conference on Computational Natural Language Learning (CoNLL) 1
features a shared task, in which participants train and test their systems on exactly the same …

HuSpaCy: an industrial-strength Hungarian natural language processing toolkit

G Orosz, Z Szántó, P Berkecz, G Szabó… - arxiv preprint arxiv …, 2022 - arxiv.org
Although there are a couple of open-source language processing pipelines available for
Hungarian, none of them satisfies the requirements of today's NLP applications. A language …

A multilingual named entity recognition system using boosting and c4. 5 decision tree learning algorithms

G Szarvas, R Farkas, A Kocsor - … , DS 2006, Barcelona, Spain, October 7 …, 2006 - Springer
In this paper we introduce a multilingual Named Entity Recognition (NER) system that uses
statistical modeling techniques. The system identifies and classifies NEs in the Hungarian …

The szeged treebank

D Csendes, J Csirik, T Gyimóthy, A Kocsor - Text, Speech and Dialogue …, 2005 - Springer
The major aim of the Szeged Treebank project was to create a high-quality database of
syntactic structures for Hungarian that can serve as a golden standard to further research in …

[PDF][PDF] HunPos-an open source trigram tagger

P Halácsy, A Kornai, C Oravecz - 2007 - eprints.sztaki.hu
In the world of non-proprietary NLP software the standard, and perhaps the best, HMM-
based POS tagger is TnT (Brants, 2000). We argue here that some of the criticism aimed at …

Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines

G Orosz, G Szabó, P Berkecz, Z Szántó… - … Conference on Text …, 2023 - Springer
This paper presents a set of industrial-grade text processing models for Hungarian that
achieve near state-of-the-art performance while balancing resource efficiency and accuracy …

[PDF][PDF] A highly accurate Named Entity corpus for Hungarian

G Szarvas, R Farkas, L Felföldi, A Kocsor, J Csirik - annotation, 2006 - Citeseer
A highly accurate Named Entity (NE) corpus for Hungarian that is publicly available for
research purposes is introduced in the paper, along with its main properties. The results of …

Morphological and syntactic case in statistical dependency parsing

W Seeker, J Kuhn - Computational Linguistics, 2013 - direct.mit.edu
Most morphologically rich languages with free word order use case systems to mark the
grammatical function of nominal elements, especially for the core argument functions of a …

[PDF][PDF] Web-based frequency dictionaries for medium density languages

A Kornai, P Halácsy, V Nagy, C Oravecz… - Proceedings of the …, 2006 - aclanthology.org
Frequency dictionaries play an important role both in psycholinguistic experiment design
and in language technology. The paper describes a new, freely available, web-based …

Context-aware correction of spelling errors in Hungarian medical documents

B Siklósi, A Novák, G Prószéky - Computer Speech & Language, 2016 - Elsevier
Owing to the growing need of acquiring medical data from clinical records, processing such
documents is an important topic in natural language processing (NLP). However, for general …