Data-driven sentence simplification: Survey and benchmark
F Alva-Manchego, C Scarton, L Specia - Computational Linguistics, 2020 - direct.mit.edu
Sentence Simplification (SS) aims to modify a sentence in order to make it easier to read
and understand. In order to do so, several rewriting transformations can be performed such …
and understand. In order to do so, several rewriting transformations can be performed such …
Corpora generation for grammatical error correction
Grammatical Error Correction (GEC) has been recently modeled using the sequence-to-
sequence framework. However, unlike sequence transduction problems such as machine …
sequence framework. However, unlike sequence transduction problems such as machine …
MUSS: Multilingual unsupervised sentence simplification by mining paraphrases
Progress in sentence simplification has been hindered by a lack of labeled parallel
simplification data, particularly in languages other than English. We introduce MUSS, a …
simplification data, particularly in languages other than English. We introduce MUSS, a …
Revisiting non-English text simplification: A unified multilingual benchmark
Recent advancements in high-quality, large-scale English resources have pushed the
frontier of English Automatic Text Simplification (ATS) research. However, less work has …
frontier of English Automatic Text Simplification (ATS) research. However, less work has …
Learning to split and rephrase from Wikipedia edit history
Split and rephrase is the task of breaking down a sentence into shorter ones that together
convey the same meaning. We extract a rich new dataset for this task by mining Wikipedia's …
convey the same meaning. We extract a rich new dataset for this task by mining Wikipedia's …
Multilingual unsupervised sentence simplification
Progress in Sentence Simplification has been hindered by the lack of supervised data,
particularly in languages other than English. Previous work has aligned sentences from …
particularly in languages other than English. Previous work has aligned sentences from …
LexFit: Lexical fine-tuning of pretrained language models
Transformer-based language models (LMs) pretrained on large text collections implicitly
store a wealth of lexical semantic knowledge, but it is non-trivial to extract that knowledge …
store a wealth of lexical semantic knowledge, but it is non-trivial to extract that knowledge …
Neural readability pairwise ranking for sentences in Italian administrative language
M Miliani, S Auriemma… - Proceedings of the …, 2022 - aclanthology.org
Abstract Automatic Readability Assessment aims at assigning a complexity level to a given
text, which could help improve the accessibility to information in specific domains, such as …
text, which could help improve the accessibility to information in specific domains, such as …
Gemv2: Multilingual nlg benchmarking in a single line of code
Evaluation in machine learning is usually informed by past choices, for example which
datasets or metrics to use. This standardization enables the comparison on equal footing …
datasets or metrics to use. This standardization enables the comparison on equal footing …
Linguistically-based comparison of different approaches to building corpora for text simplification: A case study on italian
In this paper, we present an overview of existing parallel corpora for Automatic Text
Simplification (ATS) in different languages focusing on the approach adopted for their …
Simplification (ATS) in different languages focusing on the approach adopted for their …