Evaluating human-language model interaction M Lee, M Srivastava, A Hardy, J Thickstun, E Durmus, A Paranjape, ... arXiv preprint arXiv:2212.09746, 2022 | 120 | 2022 |
Neural generation meets real people: Towards emotionally engaging mixed-initiative conversations A Paranjape, A See, K Kenealy, H Li, A Hardy, P Qi, KR Sadagopan, ... arXiv preprint arXiv:2008.12348, 2020 | 51 | 2020 |
Neural generation meets real people: Building a social, informative open-domain dialogue agent EA Chi, A Paranjape, A See, C Chiam, T Chang, K Kenealy, SK Lim, ... arXiv preprint arXiv:2207.12021, 2022 | 12 | 2022 |
Effective social chatbot strategies for increasing user initiative A Hardy, A Paranjape, CD Manning Proceedings of the 22nd annual meeting of the special interest group on …, 2021 | 12 | 2021 |
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices A Reuel, A Hardy, C Smith, M Lamparth, M Hardy, MJ Kochenderfer arXiv preprint arXiv:2411.12990, 2024 | 7 | 2024 |
Evaluating humanlanguage model interaction. arXiv M Lee, M Srivastava, A Hardy, J Thickstun, E Durmus, A Paranjape, ... arXiv preprint arXiv:2212.09746, 2022 | 5 | 2022 |
More than Marketing? On the Information Value of AI Benchmarks for Practitioners A Hardy, A Reuel, KJ Meimandi, L Soder, A Griffith, DM Asmar, S Koyejo, ... arXiv preprint arXiv:2412.05520, 2024 | 2 | 2024 |
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices A Reuel-Lamparth, A Hardy, C Smith, M Lamparth, M Hardy, ... Advances in Neural Information Processing Systems 37, 21763-21813, 2025 | | 2025 |
ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts AF Hardy, H Liu, B Lange, MJ Kochenderfer arXiv preprint arXiv:2407.09447, 2024 | | 2024 |
ICLR 2025 Workshop on Human-AI Coevolution JR Anthis, D Asmar, KR Driggs-Campbell, A Hardy, KJ Meimandi, ... ICLR 2025 Workshop Proposals, 0 | | |