Text data augmentation for deep learning

C Shorten, TM Khoshgoftaar, B Furht - Journal of big Data, 2021 - Springer
Abstract Natural Language Processing (NLP) is one of the most captivating applications of
Deep Learning. In this survey, we consider how the Data Augmentation training strategy can …

Cyberbullying detection for low-resource languages and dialects: Review of the state of the art

T Mahmud, M Ptaszynski, J Eronen, F Masui - Information Processing & …, 2023 - Elsevier
The struggle of social media platforms to moderate content in a timely manner, encourages
users to abuse such platforms to spread vulgar or abusive language, which, when …

Galactica: A large language model for science

R Taylor, M Kardas, G Cucurull, T Scialom… - arxiv preprint arxiv …, 2022 - arxiv.org
Information overload is a major obstacle to scientific progress. The explosive growth in
scientific literature and data has made it ever harder to discover useful insights in a large …

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

P Laban, T Schnabel, PN Bennett… - Transactions of the …, 2022 - direct.mit.edu
In the summarization domain, a key requirement for summaries is to be factually consistent
with the input document. Previous work has found that natural language inference (NLI) …

Camels in a changing climate: Enhancing lm adaptation with tulu 2

H Ivison, Y Wang, V Pyatkin, N Lambert… - arxiv preprint arxiv …, 2023 - arxiv.org
Since the release of T\" ULU [Wang et al., 2023b], open resources for instruction tuning have
developed quickly, from better base models to new finetuning techniques. We test and …

Task-aware retrieval with instructions

A Asai, T Schick, P Lewis, X Chen, G Izacard… - arxiv preprint arxiv …, 2022 - arxiv.org
We study the problem of retrieval with instructions, where users of a retrieval system
explicitly describe their intent along with their queries. We aim to develop a general-purpose …

The semantic scholar open data platform

R Kinney, C Anastasiades, R Authur, I Beltagy… - arxiv preprint arxiv …, 2023 - arxiv.org
The volume of scientific output is creating an urgent need for automated tools to help
scientists keep up with developments in their field. Semantic Scholar (S2) is an open data …

Ms2: Multi-document summarization of medical studies

J DeYoung, I Beltagy, M van Zuylen, B Kuehl… - arxiv preprint arxiv …, 2021 - arxiv.org
To assess the effectiveness of any medical intervention, researchers must conduct a time-
intensive and highly manual literature review. NLP systems can help to automate or assist in …

Can we automate scientific reviewing?

W Yuan, P Liu, G Neubig - Journal of Artificial Intelligence Research, 2022 - jair.org
The rapid development of science and technology has been accompanied by an
exponential growth in peer-reviewed scientific publications. At the same time, the review of …

Relatedly: Scaffolding literature reviews with existing related work sections

S Palani, A Naik, D Downey, AX Zhang… - Proceedings of the …, 2023 - dl.acm.org
Scholars who want to research a scientific topic must take time to read, extract meaning, and
identify connections across many papers. As scientific literature grows, this becomes …