The dark side of the language: Pre-trained transformers in the darknet

L Ranaldi, A Nourbakhsh, A Patrizi, ES Ruzzetti… - arxiv preprint arxiv …, 2022 - arxiv.org
Pre-trained Transformers are challenging human performances in many NLP tasks. The
massive datasets used for pre-training seem to be the key to their success on existing tasks …

Can Triplet Loss Be Used for Multi-Label Few-Shot Classification? A Case Study

GM Csányi, R Vági, A Megyeri, A Fülöp, D Nagy… - Information, 2023 - mdpi.com
Few-shot learning is a deep learning subfield that is the focus of research nowadays. This
paper addresses the research question of whether a triplet-trained Siamese network, initially …

Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL Translation

F Ranaldi, ES Ruzzetti, D Onorati, L Ranaldi… - arxiv preprint arxiv …, 2024 - arxiv.org
Understanding textual description to generate code seems to be an achieved capability of
instruction-following Large Language Models (LLMs) in zero-shot scenario. However, there …

Undesirable Memorization in Large Language Models: A Survey

A Satvaty, S Verberne, F Turkmen - arxiv preprint arxiv:2410.02650, 2024 - arxiv.org
While recent research increasingly showcases the remarkable capabilities of Large
Language Models (LLMs), it's vital to confront their hidden pitfalls. Among these challenges …

Enhancing Data Privacy in Large Language Models through Private Association Editing

D Venditti, ES Ruzzetti, GA Xompero… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) require a significant redesign in solutions to preserve
privacy in data-intensive applications due to their text-generation capabilities. Indeed, LLMs …

[PDF][PDF] Termite Italian Text-to-SQL: A CALAMITA Challenge

F Ranaldi, ES Ruzzetti, D Onorati… - Proceedings of the 10th …, 2024 - ceur-ws.org
Relational databases play an important role in business, science, and beyond. However, the
operability of relational databases is restricted to users familiar with specific languages such …

Do LLMs Dream of Ontologies?

M Bombieri, P Fiorini, SP Ponzetto… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have recently revolutionized automated text understanding
and generation. The performance of these models relies on the high number of parameters …

Prompting is not all you need Evaluating GPT-4 performance on a real-world ontology alignment use case

G Macilenti, A Stellato, M Fiorelli - Procedia Computer Science, 2024 - Elsevier
Ontology Alignment (OA) is a complex, demanding and error-prone task, requiring the
intervention of domain and Semantic Web experts. Automating the alignment process thus …

Are All Languages Equal? Curriculum Learning over Different Languages

G Pucci¹, L Ranaldi, FM Zanzotto¹ - Proceedings of the 9th Italian …, 2024 - books.google.com
Curriculum Learning (CL) is emerging as a relevant technique to reduce the cost of pre-
training Large Language Models. The idea, tested for the English language, is to train LLMs …

[PDF][PDF] Teasing LLMs Adapted to Italian.

L Ranaldi, G Pucci, ES Ruzzetti, FM Zanzotto, A Freitas - CLiC-it, 2023 - ceur-ws.org
Abstract Instruction-tuned Large Language Models (It-LLMs) are changing NLP thanks to
their easy accessibility. These models seem able to grasp language, solve complex tasks …