Transparent human evaluation for image captioning J Kasai, K Sakaguchi, L Dunagan, J Morrison, RL Bras, Y Choi, NA Smith arXiv preprint arXiv:2111.08940, 2021 | 52 | 2021 |
Bidimensional leaderboards: Generate and evaluate language hand in hand J Kasai, K Sakaguchi, RL Bras, L Dunagan, J Morrison, AR Fabbri, Y Choi, ... arXiv preprint arXiv:2112.04139, 2021 | 39 | 2021 |
Dallas Card, and David Jurgens. 2023. You don’t need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric … B Shu, L Zhang, M Choi, L Dunagan, L Logeswaran, M Lee arXiv preprint arXiv:2311.09718, 2023 | 21 | 2023 |
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments B Shu, L Zhang, M Choi, L Dunagan, L Logeswaran, M Lee, D Card, ... arXiv preprint arXiv:2311.09718, 2023 | 5 | 2023 |
Exploring linguistic style matching in online communities: The role of social context and conversation dynamics A Ananthasubramaniam, H Chen, J Yan, K Alkiek, J Pei, A Seth, ... arXiv preprint arXiv:2307.02758, 2023 | 2 | 2023 |
Collective Memory and Narrative Cohesion: A Computational Study of Palestinian Refugee Oral Histories in Lebanon G Awwad, L Dunagan, D Gamba, TN Rayan arXiv preprint arXiv:2501.13682, 2025 | | 2025 |