Automatic genre identification: a survey

T Kuzman, N Ljubešić - Language Resources and Evaluation, 2023 - Springer
Automatic genre identification (AGI) is a text classification task focused on genres, ie, text
categories defined by the author's purpose, common function of the text, and the text's …

Automatic genre identification for robust enrichment of massive text collections: Investigation of classification methods in the era of large language models

T Kuzman, I Mozetič, N Ljubešić - Machine Learning and Knowledge …, 2023 - mdpi.com
Massive text collections are the backbone of large language models, the main ingredient of
the current significant progress in artificial intelligence. However, as these collections are …

Untangling the unrestricted web: Automatic identification of multilingual registers

E Henriksson, A Myntti, A Eskelinen… - ar** question-answer (QA)
datasets by extracting QA pairs from web-scale data using machine learning (ML). Our …

Incorporating Automatically Generated Genre Labels into Neural Machine Translation Systems

M Chichirau - 2024 - fse.studenttheses.ub.rug.nl
State-of-the-art neural machine translation (NMT) systems are often highly specialized for a
certain type of text, referred to as a domain. However, the definition of a domain is still …