QAPyramid: Fine-grained Evaluation of Content Selection for Text Summarization

S Zhang, D Wan, A Cattan, A Klein, I Dagan… - arxiv preprint arxiv …, 2024 - arxiv.org
How to properly conduct human evaluations for text summarization is a longstanding
challenge. The Pyramid human evaluation protocol, which assesses content selection by …