Language models are few-shot learners T Brown, B Mann, N Ryder, M Subbiah, JD Kaplan, P Dhariwal, ... Advances in neural information processing systems 33, 1877-1901, 2020 | 40569 | 2020 |
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 7932 | 2023 |
Hierarchical text-conditional image generation with clip latents A Ramesh, P Dhariwal, A Nichol, C Chu, M Chen arXiv preprint arXiv:2204.06125 1 (2), 3, 2022 | 6850 | 2022 |
Zero-shot text-to-image generation A Ramesh, M Pavlov, G Goh, S Gray, C Voss, A Radford, M Chen, ... International conference on machine learning, 8821-8831, 2021 | 5552 | 2021 |
Evaluating large language models trained on code M Chen, J Tworek, H Jun, Q Yuan, HPDO Pinto, J Kaplan, H Edwards, ... arXiv preprint arXiv:2107.03374, 2021 | 4131* | 2021 |
Glide: Towards photorealistic image generation and editing with text-guided diffusion models A Nichol, P Dhariwal, A Ramesh, P Shyam, P Mishkin, B McGrew, ... arXiv preprint arXiv:2112.10741, 2021 | 3478 | 2021 |
Training verifiers to solve math word problems K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ... arXiv preprint arXiv:2110.14168, 2021 | 2633 | 2021 |
Generative pretraining from pixels M Chen, A Radford, R Child, J Wu, H Jun, D Luan, I Sutskever International conference on machine learning, 1691-1703, 2020 | 1877 | 2020 |
Consistency models Y Song, P Dhariwal, M Chen, I Sutskever arXiv preprint arXiv:2303.01469, 2023 | 809 | 2023 |
Point-e: A system for generating 3d point clouds from complex prompts A Nichol, H Jun, P Dhariwal, P Mishkin, M Chen arXiv preprint arXiv:2212.08751, 2022 | 506 | 2022 |
Scaling laws for autoregressive generative modeling T Henighan, J Kaplan, M Katz, M Chen, C Hesse, J Jackson, H Jun, ... arXiv preprint arXiv:2010.14701, 2020 | 383 | 2020 |
Training verifiers to solve math word problems, 2021 K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ... URL https://arxiv. org/abs/2110.14168, 2021 | 167 | 2021 |
Efficient training of language models to fill in the middle M Bavarian, H Jun, N Tezak, J Schulman, C McLeavey, J Tworek, M Chen arXiv preprint arXiv:2207.14255, 2022 | 145 | 2022 |
Gpt-4o system card A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ... arXiv preprint arXiv:2410.21276, 2024 | 132 | 2024 |
DALL· E: Creating images from text A Ramesh, M Pavlov, G Goh, S Gray, M Chen, R Child, V Misra, P Mishkin, ... OpenAI blog 2, 2021 | 108 | 2021 |
Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv: 220406125 A Ramesh, P Dhariwal, A Nichol, C Chu, M Chen | 72 | 2022 |
Language models are few-shot learners. cite TB Brown, B Mann, N Ryder, M Subbiah, J Kaplan, P Dhariwal, ... arXiv preprint arxiv:2005.14165, 2020 | 65 | 2020 |
Zero-shot text-to-image generation, 2021 A Ramesh, M Pavlov, G Goh, S Gray, C Voss, A Radford, M Chen, ... URL https://arxiv. org/abs/2102.12092 1, 2021 | 63 | 2021 |
Distribution augmentation for generative modeling H Jun, R Child, M Chen, J Schulman, A Ramesh, A Radford, I Sutskever International Conference on Machine Learning, 5006-5019, 2020 | 63 | 2020 |
Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, David W M Chen, J Tworek, H Jun, Q Yuan, H Ponde, J Kaplan, H Edwards, ... | 56 | 2021 |