MULFE: A Multi-Level Benchmark for Free Text Model Editing

C Wang, P Cao, Z **, Y Chen, D Zeng… - Proceedings of the …, 2024 - aclanthology.org
Adjusting the outdated behaviors of large langugae models (LLMs) after deployment
remains a significant challenge. It motivates the model editing research, which is however …

Pacuna: Automated fine-tuning of language models for particle accelerators

A Sulc, R Kammering, A Eichler, T Wilksen - arxiv preprint arxiv …, 2023 - arxiv.org
Navigating the landscape of particle accelerators has become increasingly challenging with
recent surges in contributions. These intricate devices challenge comprehension, even …

An Efficient Multilingual Language Model Compression through Vocabulary Trimming

A Ushio, Y Zhou, J Camacho-Collados - arxiv preprint arxiv:2305.15020, 2023 - arxiv.org
Multilingual language model (LM) have become a powerful tool in NLP especially for non-
English languages. Nevertheless, model parameters of multilingual LMs remain large due to …

HintEval: A Comprehensive Framework for Hint Generation and Evaluation for Questions

J Mozafari, B Piryani, A Abdallah, A Jatowt - arxiv preprint arxiv …, 2025 - arxiv.org
Large Language Models (LLMs) are transforming how people find information, and many
users turn nowadays to chatbots to obtain answers to their questions. Despite the instant …

ISQA: Informative Factuality Feedback for Scientific Summarization

Z Li, Y Qin, Q Liu, MY Kan - arxiv preprint arxiv:2404.13246, 2024 - arxiv.org
We propose Iterative Facuality Refining on Informative Scientific Question-Answering (ISQA)
feedback\footnote {Code is available at\url {https://github. com/lizekai-richard/isqa}}, a …

Once Upon a Replication: It is Humans' Turn to Evaluate AI's Understanding of Children's Stories for QA Generation

AM Florescu, M Micluta-Campeanu… - Proceedings of the …, 2024 - aclanthology.org
The following paper presents the outcomes of a collaborative experiment on human
evaluation from the ReproNLP 2024 shared task, track B, part of the ReproHum project. For …

Towards Vietnamese Question and Answer Generation: An Empirical Study

QH Pham, HL Le, M Dang Nhat, K Tran T… - ACM Transactions on …, 2024 - dl.acm.org
Question-answer generation (QAG) is a challenging task that generates both questions and
answers from a given input paragraph context. The QAG task has recently achieved …

Automatic Multilingual Question Generation for Health Data Using LLMs

R Ackerman, R Balyan - International Conference on AI-generated …, 2023 - Springer
Question Generation (QG) involves automatic generation of yes/no, factual and Wh-
questions created from data sources such as a database, raw text or semantic …