- Academic Search

C Schöch, T Erjavec, R Patras… - Christof Schöch; Tomaz …, 2021 - comum.rcaap.pt

The aim of this contribution is to reflect on the process of building the multilingual European
Literary Text Collection (ELTeC) that is being created in the framework of the networking …

Gem Citer Citeret af 50 Relaterede artikler Alle 10 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

[PDF][PDF] Universal dependency annotation for multilingual parsing

R McDonald, J Nivre… - Proceedings of the …, 2013 - aclanthology.org

We present a new collection of treebanks with homogeneous syntactic dependency
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …

Gem Citer Citeret af 720 Relaterede artikler Alle 11 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

What does neural bring? analysing improvements in morphosyntactic annotation and lemmatisation of Slovenian, Croatian and Serbian

N Ljubešić, K Dobrovoljc - Proceedings of the 7th workshop on …, 2019 - aclanthology.org

We present experiments on Slovenian, Croatian and Serbian morphosyntactic annotation
and lemmatisation between the former state-of-the-art for these three languages and one of …

Gem Citer Citeret af 78 Relaterede artikler Alle 3 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

CLASSLA-Stanza: The next step for linguistic processing of South Slavic languages

L Terčon, N Ljubešić - arxiv preprint arxiv:2308.04255, 2023 - arxiv.org

We present CLASSLA-Stanza, a pipeline for automatic linguistic annotation of the South
Slavic languages, which is based on the Stanza natural language processing pipeline. We …

Gem Citer Citeret af 16 Relaterede artikler Alle 2 versioner Vis som HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

[PDF][PDF] The reference corpus of the contemporary Romanian language (CoRoLa)

VB Mititelu, D Tufiş, E Irimia - Proceedings of the Eleventh …, 2018 - aclanthology.org

We present here the largest publicly available corpus of Romanian. Its written component
contains 1,257,752,812 tokens, distributed, in an unbalanced way, in several language …

Gem Citer Citeret af 65 Relaterede artikler Alle 3 versioner Vis som HTML

A tiered CRF tagger for Polish

A Radziszewski - Intelligent tools for building a scientific information …, 2013 - Springer

In this paper we present a new approach to morphosyntactic tagging of Polish by bringing
together Conditional Random Fields and tiered tagging. Our proposal also allows to take …

Gem Citer Citeret af 107 Relaterede artikler Alle 3 versioner

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

[PDF][PDF] Lemmatization and morphosyntactic tagging of Croatian and Serbian

Ž Agić, N Ljubešić, D Merkler - Proceedings of the 4th Biennial …, 2013 - aclanthology.org

We investigate state-of-the-art statistical models for lemmatization and morphosyntactic
tagging of Croatian and Serbian. The models stem from a new manually annotated …

Gem Citer Citeret af 78 Relaterede artikler Alle 5 versioner Vis som HTML

The Janes project: language resources and tools for Slovene user generated content

D Fišer, N Ljubešić, T Erjavec - Language resources and evaluation, 2020 - Springer

The paper presents the results of the Janes project, which aimed to develop language
resources and tools for Slovene user generated content. The paper first describes the 200 …

Gem Citer Citeret af 46 Relaterede artikler Alle 4 versioner

[Free GPT-4]
[DeepSeek]

[PDF] bcu-iasi.ro

[PDF][PDF] Little strokes fell great oaks: Creating CoRoLa, the reference corpus of contemporary Romanian

D Tufiș, V Barbu Mititelu, E Irimia, V Păiș, R Ion… - 2019 - dspace.bcu-iasi.ro

The paper presents the quite long-standing tradition of Romanian corpus acquisition and
processing, which reaches its peak with the reference corpus of contemporary Romanian …

Gem Citer Citeret af 41 Relaterede artikler Alle 5 versioner Vis som HTML

[BOG][B] Multilayer corpus studies

A Zeldes - 2018 - taylorfrancis.com

This volume explores the opportunities afforded by the construction and evaluation of
multilayer corpora, an emerging methodology within corpus linguistics that brings about …

Gem Citer Citeret af 41 Relaterede artikler Alle 3 versioner Bibliotekssøgning Vis som HTML

Opret underretning

Citer

Avanceret søgning

Gemt i Min samling

MULTEXT-East: morphosyntactic resources for Central and Eastern European languages

[PDF][PDF] Creating the european literary text collection (eltec): Challenges and perspectives

[PDF][PDF] Universal dependency annotation for multilingual parsing

What does neural bring? analysing improvements in morphosyntactic annotation and lemmatisation of Slovenian, Croatian and Serbian

CLASSLA-Stanza: The next step for linguistic processing of South Slavic languages

[PDF][PDF] The reference corpus of the contemporary Romanian language (CoRoLa)

A tiered CRF tagger for Polish

[PDF][PDF] Lemmatization and morphosyntactic tagging of Croatian and Serbian

The Janes project: language resources and tools for Slovene user generated content

[PDF][PDF] Little strokes fell great oaks: Creating CoRoLa, the reference corpus of contemporary Romanian

[BOG][B] Multilayer corpus studies