Suchin Gururangan

Citeret af

	Alle	Siden 2020
Henvisninger	9884	9674
h-index	21	21
i10-indeks	27	25

4500

2250

1125

3375

2018201920202021202220232024202553 132 413 853 1247 1743 4432 966

Offentlig adgang

Se alle

3 artikler

0 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Noah A. SmithUniversity of Washington; Allen Institute for Artificial IntelligenceVerificeret mail på cs.washington.edu
Luke ZettlemoyerUniversity of Washington; MetaVerificeret mail på cs.washington.edu
Swabha SwayamdiptaUniversity of Southern CaliforniaVerificeret mail på usc.edu
Roy SchwartzAssociate Professor, the School of Computer Science, the Hebrew University of JerusalemVerificeret mail på mail.huji.ac.il
Dallas CardUniversity of MichiganVerificeret mail på umich.edu
Mike LewisFacebook AI ResearchVerificeret mail på fb.com
Samuel R. BowmanAnthropic and NYUVerificeret mail på anthropic.com
Omer LevyGoogle DeepMindVerificeret mail på google.com

Følg

Suchin Gururangan

University of Washington

Verificeret mail på cs.washington.edu - Startside

Natural Language Processing Machine Learning


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Don't stop pretraining: Adapt language models to domains and tasks S Gururangan, A Marasović, S Swayamdipta, K Lo, I Beltagy, D Downey, ... arXiv preprint arXiv:2004.10964, 2020	2533	2020
The llama 3 herd of models A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, A Letman, A Mathur, ... arXiv preprint arXiv:2407.21783, 2024	2493	2024
Annotation artifacts in natural language inference data S Gururangan, S Swayamdipta, O Levy, R Schwartz, SR Bowman, ... arXiv preprint arXiv:1803.02324, 2018	1291	2018
Realtoxicityprompts: Evaluating neural toxic degeneration in language models S Gehman, S Gururangan, M Sap, Y Choi, NA Smith arXiv preprint arXiv:2009.11462, 2020	1145	2020
Editing models with task arithmetic G Ilharco, MT Ribeiro, M Wortsman, S Gururangan, L Schmidt, ... arXiv preprint arXiv:2212.04089, 2022	448	2022
All that's' human'is not gold: Evaluating human evaluation of generated text E Clark, T August, S Serrano, N Haduong, S Gururangan, NA Smith arXiv preprint arXiv:2107.00061, 2021	422	2021
Show your work: Improved reporting of experimental results J Dodge, S Gururangan, D Card, R Schwartz, NA Smith arXiv preprint arXiv:1909.03004, 2019	284	2019
Branch-train-merge: Embarrassingly parallel training of expert language models M Li, S Gururangan, T Dettmers, M Lewis, T Althoff, NA Smith, ... arXiv preprint arXiv:2208.03306, 2022	144	2022
Variational pretraining for semi-supervised text classification S Gururangan, T Dang, D Card, NA Smith arXiv preprint arXiv:1906.02242, 2019	143	2019
Detoxifying language models risks marginalizing minority voices A Xu, E Pathak, E Wallace, S Gururangan, M Sap, D Klein arXiv preprint arXiv:2104.06390, 2021	137	2021
Less: Selecting influential data for targeted instruction tuning M Xia, S Malladi, S Gururangan, S Arora, D Chen arXiv preprint arXiv:2402.04333, 2024	132	2024
Demix layers: Disentangling domains for modular language modeling S Gururangan, M Lewis, A Holtzman, NA Smith, L Zettlemoyer arXiv preprint arXiv:2108.05036, 2021	117	2021
Time waits for no one! analysis and challenges of temporal misalignment K Luu, D Khashabi, S Gururangan, K Mandyam, NA Smith arXiv preprint arXiv:2111.07408, 2021	83	2021
The llama 3 herd of models A Grattafiori, A Dubey, A Jauhri, A Pandey, A Kadian, A Al-Dahle, ... arXiv e-prints, arXiv: 2407.21783, 2024	79	2024
Osworld: Benchmarking multimodal agents for open-ended tasks in real computer environments T Xie, D Zhang, J Chen, X Li, S Zhao, R Cao, JH Toh, Z Cheng, D Shin, ... Advances in Neural Information Processing Systems 37, 52040-52094, 2025	68	2025
Silo language models: Isolating legal risk in a nonparametric datastore S Min, S Gururangan, E Wallace, W Shi, H Hajishirzi, NA Smith, ... arXiv preprint arXiv:2308.04430, 2023	62	2023
kNN-Prompt: Nearest Neighbor Zero-Shot Inference W Shi, J Michael, S Gururangan, L Zettlemoyer arXiv preprint arXiv:2205.13792, 2022	51	2022
Datacomp-lm: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... arXiv preprint arXiv:2406.11794, 2024	43	2024
Scaling expert language models with unsupervised domain discovery S Gururangan, M Li, M Lewis, W Shi, T Althoff, NA Smith, L Zettlemoyer arXiv preprint arXiv:2303.14177, 2023	43	2023
Whose language counts as high quality? measuring language ideologies in text data selection S Gururangan, D Card, SK Dreier, EK Gade, LZ Wang, Z Wang, ... arXiv preprint arXiv:2201.10474, 2022	28	2022

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–20

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere