SummaC: Re-visiting NLI-based models for inconsistency detection in summarization P Laban, T Schnabel, PN Bennett, MA Hearst Transactions of the Association for Computational Linguistics 10, 163-177, 2022 | 354 | 2022 |
Art or artifice? large language models and the false promise of creativity T Chakrabarty, P Laban, D Agarwal, S Muresan, CS Wu Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems …, 2024 | 102 | 2024 |
Understanding factual errors in summarization: Errors, summarizers, datasets, error detectors L Tang, T Goyal, AR Fabbri, P Laban, J Xu, S Yavuz, W Kryściński, ... arXiv preprint arXiv:2205.12854, 2022 | 82 | 2022 |
Keep it simple: Unsupervised simplification of multi-paragraph text P Laban, T Schnabel, P Bennett, MA Hearst arXiv preprint arXiv:2107.03444, 2021 | 66 | 2021 |
The summary loop: Learning to write abstractive summaries without examples P Laban, A Hsi, J Canny, MA Hearst arXiv preprint arXiv:2105.05361, 2021 | 66 | 2021 |
Mixqg: Neural question generation with mixed answer types L Murakhovs' ka, CS Wu, P Laban, T Niu, W Liu, C Xiong arXiv preprint arXiv:2110.08175, 2021 | 56 | 2021 |
Minicheck: Efficient fact-checking of llms on grounding documents L Tang, P Laban, G Durrett arXiv preprint arXiv:2404.10774, 2024 | 53 | 2024 |
SummEdits: Measuring LLM ability at factual reasoning through the lens of summarization P Laban, W Kryściński, D Agarwal, AR Fabbri, C Xiong, S Joty, CS Wu Proceedings of the 2023 conference on empirical methods in natural language …, 2023 | 43 | 2023 |
Can transformer models measure coherence in text? re-thinking the shuffle test P Laban, L Dai, L Bandarkar, MA Hearst arXiv preprint arXiv:2107.03448, 2021 | 38 | 2021 |
newsLens: building and visualizing long-ranging news stories P Laban, MA Hearst Proceedings of the Events and Stories in the News Workshop, 1-9, 2017 | 36 | 2017 |
Llms as factual reasoners: Insights from existing benchmarks and beyond P Laban, W Kryściński, D Agarwal, AR Fabbri, C Xiong, S Joty, CS Wu arXiv preprint arXiv:2305.14540, 2023 | 35 | 2023 |
Xgen-7b technical report E Nijkamp, T Xie, H Hayashi, B Pang, C Xia, C Xing, J Vig, S Yavuz, ... arXiv preprint arXiv:2309.03450, 2023 | 30 | 2023 |
Did you read the instructions? rethinking the effectiveness of task definitions in instruction learning F Yin, J Vig, P Laban, S Joty, C Xiong, CSJ Wu arXiv preprint arXiv:2306.01150, 2023 | 30 | 2023 |
What's the latest? A question-driven news chatbot P Laban, J Canny, MA Hearst arXiv preprint arXiv:2105.05392, 2021 | 29 | 2021 |
Embrace divergence for richer insights: A multi-document summarization benchmark and a case study on summarizing diverse information from news articles KH Huang, P Laban, AR Fabbri, PK Choubey, S Joty, C Xiong, CS Wu arXiv preprint arXiv:2309.09369, 2023 | 25 | 2023 |
Quiz design task: Helping teachers create quizzes with automated question generation P Laban, CS Wu, L Murakhovs' ka, W Liu, C Xiong arXiv preprint arXiv:2205.01730, 2022 | 25 | 2022 |
Summary of a haystack: A challenge to long-context llms and rag systems P Laban, AR Fabbri, C Xiong, CS Wu arXiv preprint arXiv:2407.01370, 2024 | 23 | 2024 |
Are you sure? challenging llms leads to performance drops in the flipflop experiment P Laban, L Murakhovs' ka, C Xiong, CS Wu arXiv preprint arXiv:2311.08596, 2023 | 21 | 2023 |
News headline grouping as a challenging nlu task P Laban, L Bandarkar, MA Hearst arXiv preprint arXiv:2105.05391, 2021 | 17 | 2021 |
Newspod: Automatic and interactive news podcasts P Laban, E Ye, S Korlakunta, J Canny, M Hearst Proceedings of the 27th International Conference on Intelligent User …, 2022 | 16 | 2022 |