Wilds: A benchmark of in-the-wild distribution shifts PW Koh, S Sagawa, H Marklund, SM Xie, M Zhang, A Balsubramani, ... International Conference on Machine Learning, 5637-5664, 2021 | 1565 | 2021 |
Openflamingo: An open-source framework for training large autoregressive vision-language models A Awadalla*, I Gao*, J Gardner, J Hessel, Y Hanafy, W Zhu, K Marathe, ... arXiv preprint arXiv:2308.01390, 2023 | 484 | 2023 |
Are aligned neural networks adversarially aligned? N Carlini, M Nasr, CA Choquette-Choo, M Jagielski, I Gao, PWW Koh, ... Advances in Neural Information Processing Systems 36, 2024 | 271 | 2024 |
CREPE: Can Vision-Language Foundation Models Reason Compositionally? Z Ma, J Hong, MO Gul, M Gandhi, I Gao, R Krishna Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 136 | 2023 |
Extending the WILDS benchmark for unsupervised adaptation S Sagawa*, PW Koh*, T Lee*, I Gao*, SM Xie, K Shen, A Kumar, W Hu, ... arXiv preprint arXiv:2112.05090, 2021 | 136 | 2021 |
Adaptive testing of computer vision models I Gao, G Ilharco, S Lundberg, MT Ribeiro Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 40 | 2023 |
Out-of-domain robustness via targeted augmentations I Gao*, S Sagawa*, PW Koh, T Hashimoto, P Liang International Conference on Machine Learning, 10800-10834, 2023 | 23 | 2023 |
Parallel worlds: Repeated initializations of the same team to improve team viability ME Whiting, I Gao, M Xing, NJ Diarrassouba, T Nguyen, MS Bernstein proceedings of the ACM on Human-Computer Interaction 4 (CSCW1), 1-22, 2020 | 20 | 2020 |
Red Teaming Large Language Models in Medicine: Real-World Insights on Model Behavior CTT Chang, H Farah, H Gui, SJ Rezaei, C Bou-Khalil, YJ Park, ... medRxiv, 2024.04. 05.24305411, 2024 | 6 | 2024 |
Model Equality Testing: Which Model Is This API Serving? I Gao, P Liang, C Guestrin International Conference on Learning Representations, 2025 | 2 | 2025 |