Fantastic LLMs for Preference Data Annotation and How to (not) Find Them

G Xu, K Xu, S Sudalairaj, H Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Preference tuning of large language models (LLMs) relies on high-quality human preference
data, which is often expensive and time-consuming to gather. While existing methods can …