Personality traits in large language models M Safdari, G Serapio-García, C Crepy, S Fitz, P Romero, L Sun, ... arXiv preprint arXiv:2307.00184, 2023 | 167 | 2023 |
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning DK Kim, M Liu, M Riemer, C Sun, M Abdulhai, G Habibi, S Lopez-Cot, ... arXiv preprint arXiv:2011.00382, 2020 | 77 | 2020 |
Moral Foundations of Large Language Models M Abdulhai, C Crepy, D Valter, J Canny, N Jaques | 60 | 2022 |
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models M Abdulhai, I White, C Snell, C Sun, J Hong, Y Zhai, K Xu, S Levine arXiv preprint arXiv:2311.18232, 2023 | 20 | 2023 |
Context-Specific Representation Abstraction for Deep Option Learning M Abdulhai, DK Kim, M Riemer, M Liu, G Tesauro, JP How arXiv preprint arXiv:2109.09876, 2021 | 15 | 2021 |
Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience M Abdulhai, N Jaques, S Levine arXiv preprint arXiv:2208.04919, 2022 | 5 | 2022 |