Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[PDF][PDF] Creating the european literary text collection (eltec): Challenges and perspectives
The aim of this contribution is to reflect on the process of building the multilingual European
Literary Text Collection (ELTeC) that is being created in the framework of the networking …
Literary Text Collection (ELTeC) that is being created in the framework of the networking …
[PDF][PDF] Universal dependency annotation for multilingual parsing
We present a new collection of treebanks with homogeneous syntactic dependency
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …
annotation for six languages: German, English, Swedish, Spanish, French and Korean. To …
What does neural bring? analysing improvements in morphosyntactic annotation and lemmatisation of Slovenian, Croatian and Serbian
We present experiments on Slovenian, Croatian and Serbian morphosyntactic annotation
and lemmatisation between the former state-of-the-art for these three languages and one of …
and lemmatisation between the former state-of-the-art for these three languages and one of …
CLASSLA-Stanza: The next step for linguistic processing of South Slavic languages
We present CLASSLA-Stanza, a pipeline for automatic linguistic annotation of the South
Slavic languages, which is based on the Stanza natural language processing pipeline. We …
Slavic languages, which is based on the Stanza natural language processing pipeline. We …
[PDF][PDF] The reference corpus of the contemporary Romanian language (CoRoLa)
We present here the largest publicly available corpus of Romanian. Its written component
contains 1,257,752,812 tokens, distributed, in an unbalanced way, in several language …
contains 1,257,752,812 tokens, distributed, in an unbalanced way, in several language …
A tiered CRF tagger for Polish
A Radziszewski - Intelligent tools for building a scientific information …, 2013 - Springer
In this paper we present a new approach to morphosyntactic tagging of Polish by bringing
together Conditional Random Fields and tiered tagging. Our proposal also allows to take …
together Conditional Random Fields and tiered tagging. Our proposal also allows to take …
[PDF][PDF] Lemmatization and morphosyntactic tagging of Croatian and Serbian
We investigate state-of-the-art statistical models for lemmatization and morphosyntactic
tagging of Croatian and Serbian. The models stem from a new manually annotated …
tagging of Croatian and Serbian. The models stem from a new manually annotated …
The Janes project: language resources and tools for Slovene user generated content
The paper presents the results of the Janes project, which aimed to develop language
resources and tools for Slovene user generated content. The paper first describes the 200 …
resources and tools for Slovene user generated content. The paper first describes the 200 …
[PDF][PDF] Little strokes fell great oaks: Creating CoRoLa, the reference corpus of contemporary Romanian
The paper presents the quite long-standing tradition of Romanian corpus acquisition and
processing, which reaches its peak with the reference corpus of contemporary Romanian …
processing, which reaches its peak with the reference corpus of contemporary Romanian …
[BOG][B] Multilayer corpus studies
A Zeldes - 2018 - taylorfrancis.com
This volume explores the opportunities afforded by the construction and evaluation of
multilayer corpora, an emerging methodology within corpus linguistics that brings about …
multilayer corpora, an emerging methodology within corpus linguistics that brings about …