The FRENK datasets of socially unacceptable discourse in Slovene and English

N Ljubešić, D Fišer, T Erjavec - … , September 11–13, 2019, Proceedings 22, 2019 - Springer
In this paper we present datasets of Facebook comment threads to mainstream media posts
in Slovene and English developed inside the Slovene national project FRENK (the acronym …

Can cross-domain term extraction benefit from cross-lingual transfer and nested term labeling?

HTH Tran, M Martinc, A Repar, N Ljubešić, A Doucet… - Machine Learning, 2024 - Springer
Automatic term extraction (ATE) is a natural language processing task that eases the effort of
manually identifying terms from domain-specific corpora by providing a list of candidate …

Can cross-domain term extraction benefit from cross-lingual transfer?

HTH Tran, M Martinc, A Doucet, S Pollak - International Conference on …, 2022 - Springer
Automatic term extraction (ATE) is a natural language processing task that eases the effort of
manually identifying terms from domain-specific corpora by providing a list of candidate …

Ensembling transformers for cross-domain automatic term extraction

HTH Tran, M Martinc, A Pelicon, A Doucet… - … Conference on Asian …, 2022 - Springer
Automatic term extraction plays an essential role in domain language understanding and
several natural language processing downstream tasks. In this paper, we propose a …

[PDF][PDF] Better web corpora for corpus linguistics and NLP

V Suchomel - Masaryk University, 2020 - is.muni.cz
The internet is used by computational linguists, lexicographers and social scientists as an
immensely large source of text data for various NLP tasks and language studies. Web …