Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time M Wortsman, G Ilharco, SY Gadre, R Roelofs, R Gontijo-Lopes, ... International conference on machine learning, 23965-23998, 2022 | 935 | 2022 |
Openflamingo: An open-source framework for training large autoregressive vision-language models A Awadalla, I Gao, J Gardner, J Hessel, Y Hanafy, W Zhu, K Marathe, ... arXiv preprint arXiv:2308.01390, 2023 | 523 | 2023 |
Datacomp: In search of the next generation of multimodal datasets SY Gadre, G Ilharco, A Fang, J Hayase, G Smyrnis, T Nguyen, R Marten, ... Advances in Neural Information Processing Systems 36, 2024 | 363 | 2024 |
Objaverse-xl: A universe of 10m+ 3d objects M Deitke, R Liu, M Wallingford, H Ngo, O Michel, A Kusupati, A Fan, ... Advances in Neural Information Processing Systems 36, 2024 | 294 | 2024 |
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation SY Gadre, M Wortsman, G Ilharco, L Schmidt, S Song arXiv preprint arXiv:2203.10421, 2022 | 205* | 2022 |
Multimodal c4: An open, billion-scale corpus of images interleaved with text W Zhu, J Hessel, A Awadalla, SY Gadre, J Dodge, A Fang, Y Yu, ... Advances in Neural Information Processing Systems 36, 2024 | 162 | 2024 |
Patching open-vocabulary models by interpolating weights G Ilharco, M Wortsman, SY Gadre, S Song, H Hajishirzi, S Kornblith, ... Advances in Neural Information Processing Systems 35, 29262-29277, 2022 | 146 | 2022 |
End-user robot programming using mixed reality SY Gadre, E Rosen, G Chien, E Phillips, S Tellex, G Konidaris 2019 International conference on robotics and automation (ICRA), 2707-2713, 2019 | 96 | 2019 |
Datacomp-lm: In search of the next generation of training sets for language models J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ... arXiv preprint arXiv:2406.11794, 2024 | 71* | 2024 |
Improving multimodal datasets with image captioning T Nguyen, SY Gadre, G Ilharco, S Oh, L Schmidt Advances in Neural Information Processing Systems 36, 22047-22069, 2023 | 66 | 2023 |
Continuous scene representations for embodied ai SY Gadre, K Ehsani, S Song, R Mottaghi Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 55 | 2022 |
Act the part: Learning interaction strategies for articulated object part discovery SY Gadre, K Ehsani, S Song Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 46 | 2021 |
Language models scale reliably with over-training and on downstream tasks SY Gadre, G Smyrnis, V Shankar, S Gururangan, M Wortsman, R Shao, ... arXiv preprint arXiv:2403.08540, 2024 | 24 | 2024 |
Structure from Action: Learning Interactions for 3D Articulated Object Structure Discovery N Nie, SY Gadre, K Ehsani, S Song 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023 | 20* | 2023 |
OpenLM: a minimal but performative language modeling (lm) repository, 2023 S Gururangan, M Wortsman, SY Gadre, A Dave, M Kilian, W Shi, J Mercat, ... URL https://github. com/mlfoundations/open_lm/. GitHub repository, 2023 | 11* | 2023 |
Should VLMs be Pre-trained with Image Data? S Keh, J Mercat, SY Gadre, K Arora, I Vasiljevic, B Burchfiel, S Song, ... The Thirteenth International Conference on Learning Representations, 0 | | |