Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research 24 (240), 1-113, 2023 | 5656 | 2023 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3152 | 2023 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1666 | 2023 |
Program synthesis with large language models J Austin, A Odena, M Nye, M Bosma, H Michalewski, D Dohan, E Jiang, ... arXiv preprint arXiv:2108.07732, 2021 | 1541 | 2021 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1120 | 2024 |
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 998 | 2024 |
Structured denoising diffusion models in discrete state-spaces J Austin, DD Johnson, J Ho, D Tarlow, R Van Den Berg Advances in neural information processing systems 34, 17981-17993, 2021 | 860 | 2021 |
Show your work: Scratchpads for intermediate computation with language models M Nye, AJ Andreassen, G Gur-Ari, H Michalewski, J Austin, D Bieber, ... | 644 | 2021 |
Scaling up models and data with t5x and seqio A Roberts, HW Chung, G Mishra, A Levskaya, J Bradbury, D Andor, ... Journal of Machine Learning Research 24 (377), 1-8, 2023 | 162 | 2023 |
Palm: Scaling language modeling with pathways, 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311, 2022 | 133 | 2022 |
Language model cascades D Dohan, W Xu, A Lewkowycz, J Austin, D Bieber, RG Lopes, Y Wu, ... arXiv preprint arXiv:2207.10342, 2022 | 84 | 2022 |
Titan: A parallel asynchronous library for multi-agent and soft-body robotics using nvidia cuda J Austin, R Corrales-Fatou, S Wyetzner, H Lipson 2020 IEEE International Conference on Robotics and Automation (ICRA), 7754-7760, 2020 | 26 | 2020 |
Measuring the impact of programming language distribution G Orlanski, K Xiao, X Garcia, J Hui, J Howland, J Malmaud, J Austin, ... International Conference on Machine Learning, 26619-26645, 2023 | 24 | 2023 |
Beyond in-place corruption: Insertion and deletion in denoising probabilistic models DD Johnson, J Austin, R Berg, D Tarlow arXiv preprint arXiv:2107.07675, 2021 | 19 | 2021 |
Resolving code review comments with machine learning A Frömmgen, J Austin, P Choy, N Ghelani, L Kharatyan, G Surita, ... Proceedings of the 46th International Conference on Software Engineering …, 2024 | 15 | 2024 |
Large vacuum flux surfaces generated by tilted planar coils JL Li, J Austin, KC Hammond, BY Israeli, FA Volpe Plasma physics and controlled fusion 61 (7), 075005, 2019 | 3 | 2019 |
Tilted Planar Interlinked Coils as a Means of Generating Rotational Transform-Modelling and Experiment. SF Mazhar, F Volpe, R Diaz-Pacheco, K Hammond, B Israeli, J Li, J Mann, ... APS April Meeting Abstracts 2018, F01. 013, 2018 | 1 | 2018 |
Machine-learned models for generating code snippets with predicted placeholders for optimizing software development DDW Johnson, DS Tarlow, M Tabachnyk, MH Rasi, J Austin, ... US Patent App. 18/618,371, 2024 | | 2024 |
Machine-learned models for generating code snippets with predicted placeholders for optimizing software development DDW Johnson, DS Tarlow, M Tabachnyk, MH Rasi, J Austin, ... US Patent 11,972,234, 2024 | | 2024 |
Prompting Machine-Learned Models Using Chains of Thought JW Wei, D Zhou, DE Schuurmans, QV Le, MP Bosma, EHH Chi, ... US Patent App. 17/881,746, 2023 | | 2023 |