Internal consistency and self-feedback in large language models: A survey

X Liang, S Song, Z Zheng, H Wang, Q Yu, X Li… - ar** into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
R Zhu, Z Ma, J Wu, J Gao, J Wang, D Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
Refusal-Aware Instruction Tuning (RAIT) enables Large Language Models (LLMs) to refuse
to answer unknown questions. By modifying responses of unknown questions in the training …