Digitising Swiss German: how to process and study a polycentric spoken language

Y Scherrer, T Samardžić, E Glaser - Language Resources and Evaluation, 2019 - Springer
Swiss dialects of German are, unlike many dialects of other standardised languages, widely
used in everyday communication. Despite this fact, automatic processing of Swiss German is …

Abstractive Summarization of Historical Documents: A New Dataset and Novel Method using a Domain-Specific Pretrained Model

K Murugaraj, S Lamsiyah, C Schommer - IEEE Access, 2025 - ieeexplore.ieee.org
Automatic Text Summarization (ATS) systems aim to generate concise summaries of
documents while preserving their essential aspects using either extractive or abstractive …

[PDF][PDF] Words divide, pictographs unite: Pictograph communication technologies for people with an intellectual disability

L Sevens - 2018 - lirias.kuleuven.be
In order to improve the accessibility of the Internet for users with reading and writing
disabilities, we develop a set of tools that automatically translate Dutch natural language text …

Historical German text normalization using type-and token-based language modeling

A Ehrmanntraut - arxiv preprint arxiv:2409.02841, 2024 - arxiv.org
Historic variations of spelling poses a challenge for full-text search or natural language
processing on historical digitized texts. To minimize the gap between the historic …

Interactive machine translation for the language modernization and spelling normalization of historical documents

M Domingo, F Casacuberta - Pattern Analysis and Applications, 2023 - Springer
Historical documents are an important part of our cultural heritage. Among other task related
to their processing, it is important to modernize their language in order to make them …

Nähetexte automatisch erkennen: Entwicklung eines linguistischen Scores für konzeptionelle Mündlichkeit in historischen Texten

K Ortmann, S Dipper - Sprechen und Gespräch in historischer Perspektive …, 2024 - Springer
Dieser Beitrag stellt einen automatisch bestimmbaren Score zur Einschätzung der
konzeptionellen Mündlichkeit eines historischen Textes vor. Der Score basiert auf einer …

How to tag non-standard language: Normalisation versus domain adaptation for slovene historical and user-generated texts

K Zupan, N Ljubešić, T Erjavec - Natural Language Engineering, 2019 - cambridge.org
Part-of-speech (PoS) tagging of non-standard language with models developed for standard
language is known to suffer from a significant decrease in accuracy. Two methods are …