Training language models to follow instructions with human feedback L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ... Advances in neural information processing systems 35, 27730-27744, 2022 | 11834 | 2022 |
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 8077 | 2023 |
Learning to summarize with human feedback N Stiennon, L Ouyang, J Wu, D Ziegler, R Lowe, C Voss, A Radford, ... Advances in Neural Information Processing Systems 33, 3008-3021, 2020 | 1890 | 2020 |
Webgpt: Browser-assisted question-answering with human feedback R Nakano, J Hilton, S Balaji, J Wu, L Ouyang, C Kim, C Hesse, S Jain, ... arXiv preprint arXiv:2112.09332, 2021 | 1129 | 2021 |
Improving image generation with better captions J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ... Computer Science. https://cdn. openai. com/papers/dall-e-3. pdf 2 (3), 8, 2023 | 812 | 2023 |
Training language models to follow instructions with human feedback, 2022 L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... URL https://arxiv. org/abs/2203.02155 13, 1, 2022 | 302 | 2022 |
Recursively summarizing books with human feedback J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano arXiv preprint arXiv:2109.10862, 2021 | 266 | 2021 |
Self-critiquing models for assisting human evaluators W Saunders, C Yeh, J Wu, S Bills, L Ouyang, J Ward, J Leike arXiv preprint arXiv:2206.05802, 2022 | 230 | 2022 |
Gpt-4o system card A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ... arXiv preprint arXiv:2410.21276, 2024 | 144 | 2024 |
Training language models to follow instructions with human feedback. arXiv L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... arXiv preprint arXiv:2203.02155, 2022 | 134 | 2022 |
Training language models to follow instructions with human feedback, March 2022 L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... URL http://arxiv. org/abs/2203.02155 92, 0 | 58 | |
Training language models to follow instructions with human feedback. arXiv 2022 L Ouyang, J Wu, X Jiang, D Almeida, CL Wainwright, P Mishkin, C Zhang, ... arXiv preprint arXiv:2203.02155 10, 2022 | 51 | 2022 |
Improving image generation with better captions. 2023 J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ... URL https://cdn. openai. com/papers/dall-e-3. pdf, 2023 | 43 | 2023 |
Learning to summarize from human feedback, 2020 N Stiennon, L Ouyang, J Wu, DM Ziegler, R Lowe, C Voss, A Radford, ... URL https://arxiv. org/abs, 2009 | 24 | 2009 |
Practical optimal experiment design with probabilistic programs L Ouyang, MH Tessler, D Ly, N Goodman arXiv preprint arXiv:1608.05046, 2016 | 22 | 2016 |
Semantic coherence facilitates distributional learning L Ouyang, L Boroditsky, MC Frank Cognitive science 41, 855-884, 2017 | 20 | 2017 |
Recursively summarizing books with human feedback, 2021 J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano URL https://arxiv. org/abs/2109.10862, 0 | 11 | |
ja Ramesh, A., 2023 J Betker, G Goh, L Jing, T Brooks, J Wang, L Li, L Ouyang, J Zhuang, ... Improving Image Generation with Better Captions, 0 | 9 | |
Fabular: Regression formulas as probabilistic programming J Borgström, AD Gordon, L Ouyang, C Russo, A Ścibior, M Szymczak Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of …, 2016 | 8 | 2016 |
webppl-oed: A practical optimal experiment design system. L Ouyang, MH Tessler, D Ly, ND Goodman CogSci, 2018 | 7 | 2018 |