PaLM: Scaling Language Modeling with Pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research 24 (1-113), 2023 | 5605 | 2023 |
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (1-53), 2024 | 3370 | 2024 |
Gemini: A family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 3181 | 2023 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1569 | 2023 |
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... Transactions on Machine Learning Research, https://openreview.net/pdf?id …, 2023 | 1345 | 2023 |
PaLI: A Jointly-Scaled Multilingual Language-Image Model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... The Eleventh International Conference on Learning Representations, 2023, 2023 | 677 | 2023 |
Scaling Up Models and Data with t5x and seqio A Roberts, HW Chung, A Levskaya, G Mishra, J Bradbury, D Andor, ... Journal of Machine Learning Research 24, 1-8, 2023 | 162 | 2023 |
Palm: Scaling language modeling with pathways. arXiv 2022 A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... arXiv preprint arXiv:2204.02311 10, 2022 | 123 | 2022 |
Efficient characterization of tennis shots and game analysis using wearable sensors data R Srivastava, A Patwari, S Kumar, G Mishra, L Kaligounder, P Sinha 2015 IEEE sensors, 1-4, 2015 | 29 | 2015 |
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models A Lewkowycz, A Slone, A Andreassen, D Freeman, ES Dyer, G Mishra, ... Technical report, 2022 | 5 | 2022 |
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability J Hron, L Culp, G Elsayed, R Liu, B Adlam, M Bileschi, B Bohnet, ... arXiv preprint arXiv:2408.07852, 2024 | 2 | 2024 |
Training of large neural networks S Petrov, Y Wu, AM Dai, DR So, D Lepikhin, EA Moreira, G Mishra, ... US Patent App. 18/661,447, 2024 | | 2024 |
Training of large neural networks S Petrov, Y Wu, AM Dai, DR So, D Lepikhin, EA Moreira, G Mishra, ... US Patent App. 18/661,499, 2024 | | 2024 |
Frontier Language Models are not Robust to Adversarial Arithmetic, or" What do I need to say so you agree 2+ 2= 5? CD Freeman, L Culp, A Parisi, ML Bileschi, GF Elsayed, A Rizkowsky, ... arXiv preprint arXiv:2311.07587, 2023 | | 2023 |
Deterministic training of machine learning models G Mishra, AJ Roberts, NM Shazeer, MP Bosma US Patent App. 18/219,555, 2023 | | 2023 |
Deterministic training of machine learning models G Mishra, AJ Roberts, NM Shazeer, MP Bosma US Patent App. 18/130,339, 2023 | | 2023 |
Deterministic training of machine learning models G Mishra, A Roberts, N Shazeer, M Bosma US Patent US20230351190A1, 2023 | | 2023 |
Method and Device for Generating Data Representing Structure of Room G Mishra, A Patwari, R Srivastava, A De, D Patkar US Patent US20150347638A1, 2015 | | 2015 |
Incorporation of Particle Swarm Optimization in Adaptive Boosting G Mishra, R Kumar, S Chaudhury Pattern Recognition and Machine Intelligence: 5th International Conference …, 2013 | | 2013 |