Multils: A multi-task lexical simplification framework

K North, T Ranasinghe, M Shardlow… - arxiv preprint arxiv …, 2024 - arxiv.org
Lexical Simplification (LS) automatically replaces difficult to read words for easier
alternatives while preserving a sentence's original meaning. LS is a precursor to Text …

Advancing Generative AI for Portuguese with Open Decoder Gerv\'asio PT

R Santos, J Silva, L Gomes, J Rodrigues… - arxiv preprint arxiv …, 2024 - arxiv.org
To advance the neural decoding of Portuguese, in this paper we present a fully open
Transformer-based, instruction-tuned decoder model that sets a new state of the art in this …

[HTML][HTML] Teenytinyllama: open-source tiny language models trained in brazilian portuguese

NK Corrêa, S Falk, S Fatimah, A Sen… - Machine Learning with …, 2024 - Elsevier
Large language models (LLMs) have significantly advanced natural language processing,
but their progress has yet to be equal across languages. While most LLMs are trained in …

The importance of context for sentiment analysis in dialogues

I Carvalho, HG Oliveira, C Silva - IEEE Access, 2023 - ieeexplore.ieee.org
Sentiment Analysis (SA) can be applied to dialogues to determine the emotional tone
throughout the conversation. This is beneficial for dialogue systems because it may improve …

ptt5-v2: A closer look at continued pretraining of t5 models for the portuguese language

M Piau, R Lotufo, R Nogueira - Brazilian Conference on Intelligent …, 2024 - Springer
Abstract Despite advancements in Natural Language Processing (NLP) and the growing
availability of pretrained models, the English language remains the primary focus of model …

[PDF][PDF] Exploring Portuguese Hate Speech Detection in Low-Resource Settings: Lightly Tuning Encoder Models or In-Context Learning of Large Models?

G Assis, A Amorim, J Carvalho… - Proceedings of the …, 2024 - aclanthology.org
Automatically identifying hate speech is an emerging field driven by the growth of social
media and the consequent amplification of communication. However, this domain faces …

PORTULAN ExtraGLUE datasets and models: Kick-starting a benchmark for the neural processing of Portuguese

T Osório, B Leite, HL Cardoso, L Gomes… - arxiv preprint arxiv …, 2024 - arxiv.org
Leveraging research on the neural modelling of Portuguese, we contribute a collection of
datasets for an array of language processing tasks and a corresponding collection of fine …

[PDF][PDF] Automatic text readability assessment in European Portuguese

E Ribeiro, N Mamede, J Baptista - Proceedings of the 16th …, 2024 - aclanthology.org
The automatic assessment of text readability and the classification of texts by levels is
essential for language education and languagerelated industries that rely on effective …

[PDF][PDF] A named entity recognition approach for Portuguese legislative texts using self-learning

RO Nunes, DG Balreira, AS Spritzer… - Proceedings of the …, 2024 - aclanthology.org
Even if technology has made legislative documents more accessible, they are often written
in jargon that makes them hard to understand for ordinary citizens, researchers, journalists …

[PDF][PDF] Robertalexpt: A legal roberta model pretrained with deduplication for portuguese

EAS Garcia, NFF Silva, F Siqueira… - Proceedings of the …, 2024 - aclanthology.org
This work investigates the application of Natural Language Processing (NLP) in the legal
context for the Portuguese language, emphasizing the importance of adapting pre-trained …