Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 9438* | 2023 |
Flamingo: a visual language model for few-shot learning JB Alayrac, J Donahue, P Luc, A Miech, I Barr, Y Hasson, K Lenc, ... Advances in neural information processing systems 35, 23716-23736, 2022 | 3796 | 2022 |
Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021 | 1214 | 2021 |
Improving language models by retrieving from trillions of tokens S Borgeaud, A Mensch, J Hoffmann, T Cai, E Rutherford, K Millican, ... International conference on machine learning, 2206-2240, 2022 | 1104 | 2022 |
Multimodal few-shot learning with frozen language models M Tsimpoukelli, JL Menick, S Cabi, SM Eslami, O Vinyals, F Hill Advances in Neural Information Processing Systems 34, 200-212, 2021 | 778 | 2021 |
Automated curriculum learning for neural networks A Graves, MG Bellemare, J Menick, R Munos, K Kavukcuoglu international conference on machine learning, 1311-1320, 2017 | 660 | 2017 |
Rigging the lottery: Making all tickets winners U Evci, T Gale, J Menick, PS Castro, E Elsen International conference on machine learning, 2943-2952, 2020 | 652 | 2020 |
ChatGPT: Optimizing language models for dialogue J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, JFC Uribe, ... OpenAI blog 2 (4), 2022 | 374* | 2022 |
Teaching language models to support answers with verified quotes J Menick, M Trebacz, V Mikulik, J Aslanides, F Song, M Chadwick, ... arXiv preprint arXiv:2203.11147, 2022 | 215 | 2022 |
Generating images with sparse representations C Nash, J Menick, S Dieleman, PW Battaglia arXiv preprint arXiv:2103.03841, 2021 | 198 | 2021 |
Generating high fidelity images with subscale pixel networks and multidimensional upscaling J Menick, N Kalchbrenner arXiv preprint arXiv:1812.01608, 2018 | 159 | 2018 |
Multiplicative interactions and where to find them SM Jayakumar, WM Czarnecki, J Menick, J Schwarz, J Rae, S Osindero, ... International conference on learning representations, 2020 | 143 | 2020 |
Gpt-4o system card A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ... arXiv preprint arXiv:2410.21276, 2024 | 132 | 2024 |
A practical sparse approximation for real time recurrent learning J Menick, E Elsen, U Evci, S Osindero, K Simonyan, A Graves arXiv preprint arXiv:2006.07232, 2020 | 59* | 2020 |
Noisy networks for exploration. arXiv 2017 M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295, 0 | 45 | |
Data compression using jointly trained encoder, decoder, and prior neural networks JL Menick, AB Graves US Patent App. 16/767,010, 2021 | 26 | 2021 |
Reduced computation real time recurrent learning JL Menick, EK Elsen, K Simonyan US Patent App. 17/169,083, 2021 | 22 | 2021 |
Associative compression networks for representation learning A Graves, J Menick, A Oord arXiv preprint arXiv:1804.02476, 2018 | 21 | 2018 |
Alethea Power, Stanislas Polu, Jesse Han, Raul Puri, Shawn Jain J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, JFC Uribe, ... Benjamin Chess, Christian Gibson, Oleg Boiko, Emy Parparita, Amin …, 2022 | 16 | 2022 |
Introducing ChatGPT. OpenAI J Schulman, B Zoph, C Kim, J Hilton, J Menick, J Weng, C Hesse | 7 | 2022 |