Kee** humans in the loop: Human-centered automated annotation with generative ai

N Pangakis, S Wolken - arxiv preprint arxiv:2409.09467, 2024 - arxiv.org
Automated text annotation is a compelling use case for generative large language models
(LLMs) in social media research. Recent work suggests that LLMs can achieve strong …

Ethics Whitepaper: Whitepaper on Ethical Research into Large Language Models

EL Ungless, N Vitsakis, Z Talat, J Garforth… - arxiv preprint arxiv …, 2024 - arxiv.org
This whitepaper offers an overview of the ethical considerations surrounding research into
or with large language models (LLMs). As LLMs become more integrated into widely used …

A Little Human Data Goes A Long Way

D Ashok, J May - arxiv preprint arxiv:2410.13098, 2024 - arxiv.org
Faced with an expensive human annotation process, creators of NLP systems increasingly
turn to synthetic data generation. While this method shows promise, the extent to which …

Data advisor: Dynamic data curation for safety alignment of large language models

F Wang, N Mehrabi, P Goyal, R Gupta… - arxiv preprint arxiv …, 2024 - arxiv.org
Data is a crucial element in large language model (LLM) alignment. Recent studies have
explored using LLMs for efficient data collection. However, LLM-generated data often suffers …