دنبال کردن
Amelia Hardy
Amelia Hardy
Master's Student in Computer Science, Stanford University
ایمیل تأیید شده در stanford.edu
عنوان
نقل شده توسط
نقل شده توسط
سال
Evaluating human-language model interaction
M Lee, M Srivastava, A Hardy, J Thickstun, E Durmus, A Paranjape, ...
arXiv preprint arXiv:2212.09746, 2022
1202022
Neural generation meets real people: Towards emotionally engaging mixed-initiative conversations
A Paranjape, A See, K Kenealy, H Li, A Hardy, P Qi, KR Sadagopan, ...
arXiv preprint arXiv:2008.12348, 2020
512020
Neural generation meets real people: Building a social, informative open-domain dialogue agent
EA Chi, A Paranjape, A See, C Chiam, T Chang, K Kenealy, SK Lim, ...
arXiv preprint arXiv:2207.12021, 2022
122022
Effective social chatbot strategies for increasing user initiative
A Hardy, A Paranjape, CD Manning
Proceedings of the 22nd annual meeting of the special interest group on …, 2021
122021
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
A Reuel, A Hardy, C Smith, M Lamparth, M Hardy, MJ Kochenderfer
arXiv preprint arXiv:2411.12990, 2024
72024
Evaluating humanlanguage model interaction. arXiv
M Lee, M Srivastava, A Hardy, J Thickstun, E Durmus, A Paranjape, ...
arXiv preprint arXiv:2212.09746, 2022
52022
More than Marketing? On the Information Value of AI Benchmarks for Practitioners
A Hardy, A Reuel, KJ Meimandi, L Soder, A Griffith, DM Asmar, S Koyejo, ...
arXiv preprint arXiv:2412.05520, 2024
22024
BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices
A Reuel-Lamparth, A Hardy, C Smith, M Lamparth, M Hardy, ...
Advances in Neural Information Processing Systems 37, 21763-21813, 2025
2025
ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts
AF Hardy, H Liu, B Lange, MJ Kochenderfer
arXiv preprint arXiv:2407.09447, 2024
2024
ICLR 2025 Workshop on Human-AI Coevolution
JR Anthis, D Asmar, KR Driggs-Campbell, A Hardy, KJ Meimandi, ...
ICLR 2025 Workshop Proposals, 0
سیستم در حال حاضر قادر به انجام عملکرد نیست. بعداً دوباره امتحان کنید.
مقاله‌ها 1–10