On First-Order Meta-Learning Algorithms A Nichol, J Achiam, J Schulman arXiv preprint arXiv:1803.02999, 2018 | 2669 | 2018 |
Constrained Policy Optimization J Achiam, D Held, A Tamar, P Abbeel International conference on machine learning, 22-31, 2017 | 1668 | 2017 |
Benchmarking Safe Exploration in Deep Reinforcement Learning A Ray, J Achiam, D Amodei arXiv preprint arXiv:1910.01708, 2019 | 477 | 2019 |
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods A Stooke, J Achiam, P Abbeel International Conference on Machine Learning, 9133-9143, 2020 | 319 | 2020 |
Spinning Up in Deep Reinforcement Learning J Achiam https://spinningup.openai.com/en/latest/, 2018 | 309* | 2018 |
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning J Achiam, S Sastry arXiv preprint arXiv:1703.01732, 2017 | 291 | 2017 |
Variational Option Discovery Algorithms J Achiam, H Edwards, D Amodei, P Abbeel arXiv preprint arXiv:1807.10299, 2018 | 212 | 2018 |
Gpt-4o system card A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ... arXiv preprint arXiv:2410.21276, 2024 | 129 | 2024 |
Towards Characterizing Divergence in Deep Q-Learning J Achiam, E Knight, P Abbeel arXiv preprint arXiv:1903.08894, 2019 | 116 | 2019 |
A Hazard Analysis Framework for Code Synthesis Large Language Models H Khlaaf, P Mishkin, J Achiam, G Krueger, M Brundage arXiv preprint arXiv:2207.14157, 2022 | 25 | 2022 |
Rule based rewards for fine-grained llm safety T Mu, A Helyar, J Heidecke, J Achiam, A Vallone, ID Kivlichan, M Lin, ... ICML 2024 Next Generation of AI Safety Workshop, 2024 | 12* | 2024 |
Transformer Debugger D Mossing, S Bills, H Tillman, TD la Tour, N Cammarata, L Gao, J Achiam, ... | 8 | 2024 |
Advanced Policy Gradient Methods J Achiam http://rail.eecs.berkeley.edu/deeprlcourse-fa17/f17docs …, 2017 | 6 | 2017 |
Exploration and Safety in Deep Reinforcement Learning JS Achiam University of California, Berkeley, 2021 | 5 | 2021 |
Simplified PPO-Clip Objective J Achiam https://drive.google.com/file/d/1PDzn9RPvaXjJFZkGeapMHbHGiWWW20Ey/view, 2018 | 5 | 2018 |
Training Dynamics Models for Accurate Long-Horizon Prediction E Knight, J Achiam, UC OpenAI | | |