Suivre
Joshua Achiam
Joshua Achiam
Research Scientist, OpenAI
Adresse e-mail validée de openai.com - Page d'accueil
Titre
Citée par
Citée par
Année
On First-Order Meta-Learning Algorithms
A Nichol, J Achiam, J Schulman
arXiv preprint arXiv:1803.02999, 2018
26692018
Constrained Policy Optimization
J Achiam, D Held, A Tamar, P Abbeel
International conference on machine learning, 22-31, 2017
16682017
Benchmarking Safe Exploration in Deep Reinforcement Learning
A Ray, J Achiam, D Amodei
arXiv preprint arXiv:1910.01708, 2019
4772019
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
A Stooke, J Achiam, P Abbeel
International Conference on Machine Learning, 9133-9143, 2020
3192020
Spinning Up in Deep Reinforcement Learning
J Achiam
https://spinningup.openai.com/en/latest/, 2018
309*2018
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
J Achiam, S Sastry
arXiv preprint arXiv:1703.01732, 2017
2912017
Variational Option Discovery Algorithms
J Achiam, H Edwards, D Amodei, P Abbeel
arXiv preprint arXiv:1807.10299, 2018
2122018
Gpt-4o system card
A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ...
arXiv preprint arXiv:2410.21276, 2024
1292024
Towards Characterizing Divergence in Deep Q-Learning
J Achiam, E Knight, P Abbeel
arXiv preprint arXiv:1903.08894, 2019
1162019
A Hazard Analysis Framework for Code Synthesis Large Language Models
H Khlaaf, P Mishkin, J Achiam, G Krueger, M Brundage
arXiv preprint arXiv:2207.14157, 2022
252022
Rule based rewards for fine-grained llm safety
T Mu, A Helyar, J Heidecke, J Achiam, A Vallone, ID Kivlichan, M Lin, ...
ICML 2024 Next Generation of AI Safety Workshop, 2024
12*2024
Transformer Debugger
D Mossing, S Bills, H Tillman, TD la Tour, N Cammarata, L Gao, J Achiam, ...
82024
Advanced Policy Gradient Methods
J Achiam
http://rail.eecs.berkeley.edu/deeprlcourse-fa17/f17docs …, 2017
62017
Exploration and Safety in Deep Reinforcement Learning
JS Achiam
University of California, Berkeley, 2021
52021
Simplified PPO-Clip Objective
J Achiam
https://drive.google.com/file/d/1PDzn9RPvaXjJFZkGeapMHbHGiWWW20Ey/view, 2018
52018
Training Dynamics Models for Accurate Long-Horizon Prediction
E Knight, J Achiam, UC OpenAI
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–16