SemEval-2024 task 2: Safe biomedical natural language inference for clinical trials

M Jullien, M Valentino, A Freitas - arxiv preprint arxiv:2404.04963, 2024 - arxiv.org
Large Language Models (LLMs) are at the forefront of NLP achievements but fall short in
dealing with shortcut learning, factual inconsistency, and vulnerability to adversarial inputs …

Factcheck-bench: Fine-grained evaluation benchmark for automatic fact-checkers

Y Wang, RG Reddy, Z Mujahid, A Arora… - Findings of the …, 2024 - aclanthology.org
The increased use of large language models (LLMs) across a variety of real-world
applications calls for mechanisms to verify the factual accuracy of their outputs. In this work …

Saama AI Research at SemEval-2023 Task 7: Exploring the Capabilities of Flan-T5 for Multi-evidence Natural Language Inference in Clinical Trial Data

KR Kanakarajan, M Sankarasubbu - Proceedings of the 17th …, 2023 - aclanthology.org
The goal of the NLI4CT task is to build a Natural Language Inference system for Clinical
Trial Reports that will be used for evidence interpretation and retrieval. Large Language …

T5-medical at semeval-2024 task 2: Using t5 medical embedding for natural language inference on clinical trial data

M Siino - Proceedings of the 18th International Workshop on …, 2024 - aclanthology.org
In this work, we address the challenge of identifying the inference relation between a plain
language statement and Clinical Trial Reports (CTRs) by using a T5-large model …

MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets.

R Van Der Goot - Proceedings of the 17th International Workshop …, 2023 - aclanthology.org
To improve the ability of language models to handle Natural Language Processing (NLP)
tasks and intermediate step of pre-training has recently beenintroduced. In this setup, one …

Sebis at SemEval-2023 task 7: A joint system for natural language inference and evidence retrieval from clinical trial reports

J Vladika, F Matthes - arxiv preprint arxiv:2304.13180, 2023 - arxiv.org
With the increasing number of clinical trial reports generated every day, it is becoming hard
to keep up with novel discoveries that inform evidence-based healthcare recommendations …

Knowcomp at semeval-2023 task 7: Fine-tuning pre-trained language models for clinical trial entailment identification

W Wang, B Xu, T Fang, L Zhang… - Proceedings of the 17th …, 2023 - aclanthology.org
In this paper, we present our system for the textual entailment identification task as a subtask
of the SemEval-2023 Task 7: Multi-evidence Natural Language Inference for Clinical Trial …

Semeval-2024 task 7: Numeral-aware language understanding and generation

CC Chen, JT Huang, HH Huang… - Proceedings of the …, 2024 - aclanthology.org
Numbers are frequently utilized in both our daily narratives and professional documents,
such as clinical notes, scientific papers, financial documents, and legal court orders. The …

[HTML][HTML] Patients' selection and trial matching in early-phase oncology clinical trials

P Corbaux, A Bayle, S Besle, A Vinceneux… - Critical Reviews in …, 2024 - Elsevier
Background Early-phase clinical trials (EPCT) represent an important part of innovations in
medical oncology and a valuable therapeutic option for patients with metastatic cancers …

Large language models, scientific knowledge and factuality: A systematic analysis in antibiotic discovery

M Wysocka, O Wysocki, M Delmas, V Mutel… - arxiv preprint arxiv …, 2023 - arxiv.org
Inferring over and extracting information from Large Language Models (LLMs) trained on a
large corpus of scientific literature can potentially drive a new era in biomedical research …