Sledovat
Alicia Parrish
Alicia Parrish
Google
E-mailová adresa ověřena na: nyu.edu - Domovská stránka
Název
Citace
Citace
Rok
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
32082023
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
15732023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
13562022
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
11702024
BLiMP: The benchmark of linguistic minimal pairs for English
A Warstadt, A Parrish, H Liu, A Mohananey, W Peng, SF Wang, ...
Transactions of the Association for Computational Linguistics 8, 377-392, 2020
4722020
Gemma 2: Improving open language models at a practical size
G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ...
arXiv preprint arXiv:2408.00118, 2024
4242024
BBQ: A hand-built bias benchmark for question answering
A Parrish, A Chen, N Nangia, V Padmakumar, J Phang, J Thompson, ...
arXiv preprint arXiv:2110.08193, 2021
3312021
Investigating BERT's knowledge of language: five analysis methods with NPIs
A Warstadt, Y Cao, I Grosu, W Peng, H Blix, Y Nie, A Alsop, S Bordia, ...
arXiv preprint arXiv:1909.02597, 2019
1392019
Dataperf: Benchmarks for data-centric ai development
M Mazumder, C Banbury, X Yao, B Karlaš, W Gaviria Rojas, S Diamos, ...
Advances in Neural Information Processing Systems 36, 5320-5347, 2023
1372023
QuALITY: Question answering with long input texts, yes!
RY Pang, A Parrish, N Joshi, N Nangia, J Phang, A Chen, V Padmakumar, ...
arXiv preprint arXiv:2112.08608, 2021
1302021
Inverse scaling: When bigger isn't better
IR McKenzie, A Lyzhov, M Pieler, A Parrish, A Mueller, A Prabhu, ...
arXiv preprint arXiv:2306.09479, 2023
922023
Palm 2 technical report. arXiv 2023
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 0
84
Dices dataset: Diversity in conversational ai evaluation for safety
L Aroyo, A Taylor, M Diaz, C Homan, A Parrish, G Serapio-García, ...
Advances in Neural Information Processing Systems 36, 53330-53342, 2023
422023
Does putting a linguist in the loop improve NLU data collection?
A Parrish, W Huang, O Agha, SH Lee, N Nangia, A Warstadt, K Aggarwal, ...
arXiv preprint arXiv:2104.07179, 2021
412021
What do NLP researchers believe? Results of the NLP community metasurvey
J Michael, A Holtzman, A Parrish, A Mueller, A Wang, A Chen, D Madaan, ...
arXiv preprint arXiv:2208.12852, 2022
352022
Introducing v0. 5 of the ai safety benchmark from mlcommons
B Vidgen, A Agrawal, AM Ahmed, V Akinwande, N Al-Nuaimi, N Alfaraj, ...
arXiv preprint arXiv:2404.12241, 2024
342024
A toolbox for surfacing health equity harms and biases in large language models
SR Pfohl, H Cole-Lewis, R Sayres, D Neal, M Asiedu, A Dieng, ...
Nature Medicine 30 (12), 3590-3600, 2024
302024
Two failures of self-consistency in the multi-step reasoning of LLMs
A Chen, J Phang, A Parrish, V Padmakumar, C Zhao, SR Bowman, K Cho
arXiv preprint arXiv:2305.14279, 2023
292023
NOPE: A corpus of naturally-occurring presuppositions in English
A Parrish, S Schuster, A Warstadt, O Agha, SH Lee, Z Zhao, SR Bowman, ...
arXiv preprint arXiv:2109.06987, 2021
242021
Adversarial nibbler: An open red-teaming method for identifying diverse harms in text-to-image generation
J Quaye, A Parrish, O Inel, C Rastogi, HR Kirk, M Kahng, E Van Liemt, ...
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and …, 2024
22*2024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–20