Deep learning-based user experience evaluation in distance learning

R Sadigov, E Yıldırım, B Kocaçınar, F Patlar Akbulut… - Cluster …, 2024 - Springer
The Covid-19 pandemic caused uncertainties in many different organizations, institutions
gained experience in remote working and showed that high-quality distance education is a …

Text document clustering: Wordnet vs. TF-IDF vs. word embeddings

M Marcińczuk, M Gniewkowski… - Proceedings of the …, 2021 - aclanthology.org
In the paper, we deal with the problem of unsupervised text document clustering for the
Polish language. Our goal is to compare the modern approaches based on language …

The CLARIN infrastructure as an interoperable language technology platform for SSH and beyond

A Branco, M Eskevich, F Frontini, J Hajič… - Language resources …, 2023 - Springer
CLARIN is a European Research Infrastructure Consortium develo** and providing a
federated and interoperable platform to support scientists in the field of the Social Sciences …

Bag-of-words, bag-of-topics and word-to-vec based subject classification of text documents in polish-a comparative study

T Walkowiak, S Datko, H Maciejewski - … , July 2-6, 2018, Brunów, Poland …, 2019 - Springer
This paper deals with the problem of classification of Polish language documents in terms of
a subject category. We compare four state-of-the-art approaches to this task which differ …

StyloMetrix: An Open-Source Multilingual Tool for Representing Stylometric Vectors

I Okulska, D Stetsenko, A Kołos, A Karlińska… - arxiv preprint arxiv …, 2023 - arxiv.org
This work aims to provide an overview on the open-source multilanguage tool called
StyloMetrix. It offers stylometric text representations that cover various aspects of grammar …

New parallel corpora of Baltic and Slavic languages—Assumptions of corpus construction

M Duszkin, D Roszko, R Roszko - International Conference on Text …, 2021 - Springer
In this article, we describe the design principles of the ten newly published CLARIN-PL
corpora of Slavic and Baltic languages. In relation to other non-commercial online corpora …

[PDF][PDF] Implementation of language models within an infrastructure designed for Natural Language Processing

B Walkowiak, T Walkowiak - International Journal of Electronics and …, 2024 - journals.pan.pl
This paper explores cost-effective alternatives for resource-constrained environments in the
context of language models by investigating methods such as quantization and CPU-based …

Towards CLARIN-PL LTC digital research platform for: Depositing, processing, analyzing and visualizing language data

M Pol, T Walkowiak, M Piasecki - … and Communication, RelStat'17, 18-21 …, 2018 - Springer
The paper presents a new functionality of CLARIN-PL Language Technology Centre (LTC).
LTC Platform is developed as a research place for processing, visualizing and depositing …

Feature extraction in subject classification of text documents in polish

T Walkowiak, S Datko, H Maciejewski - … 2018, Zakopane, Poland, June 3-7 …, 2018 - Springer
In this work we evaluate two different methods for deriving features for a subject
classification of text documents. The first method uses the standard Bag-of-Words (BoW) …

Named entity recognition for Polish

M Marcińczuk, A Wawer - Poznan Studies in Contemporary …, 2019 - degruyter.com
In this article we discuss the current state-of-the-art for named entity recognition for Polish.
We present publicly available resources and open-source tools for named entity recognition …