Følg
Eric Hambro
Eric Hambro
Anthropic
Verificeret mail på anthropic.com - Startside
Titel
Citeret af
Citeret af
År
LLaMA: Open and efficient foundation language models
H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ...
arXiv preprint arXiv:2302.13971, 2023
131982023
Toolformer: Language models can teach themselves to use tools
T Schick, J Dwivedi-Yu, R Dessì, R Raileanu, M Lomeli, E Hambro, ...
Advances in Neural Information Processing Systems 36, 68539-68551, 2023
14602023
Minihack the planet: A sandbox for open-ended reinforcement learning research
M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ...
NeurIPS 2021 Datasets and Benchmarks, 2021
972021
Understanding the effects of rlhf on llm generalisation and diversity
R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ...
arXiv preprint arXiv:2310.06452, 2023
872023
Rainbow teaming: Open-ended generation of diverse adversarial prompts
M Samvelyan, SC Raparthy, A Lupu, E Hambro, A Markosyan, M Bhatt, ...
Advances in Neural Information Processing Systems 37, 69747-69786, 2025
492025
Teaching large language models to reason with reinforcement learning
A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ...
arXiv preprint arXiv:2403.04642, 2024
482024
GPflux: A library for deep Gaussian processes
V Dutordoir, H Salimbeni, E Hambro, J McLeod, F Leibfried, A Artemev, ...
arXiv preprint arXiv:2104.05674, 2021
352021
Glore: When, where, and how to improve llm reasoning via global and local refinements
A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ...
arXiv preprint arXiv:2402.10963, 2024
302024
Insights from the Neurips 2021 Nethack Challenge
E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ...
NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022
222022
LLaMA: open and efficient foundation language models. DOI: 10.48550
H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ...
arXiv preprint arXiv.2302.13971 2302, 2023
202023
Dungeons and Data: A Large-Scale NetHack Dataset
E Hambro, R Raileanu, D Rothermel, V Mella, T Rocktäschel, H Küttler, ...
Advances in Neural Information Processing Systems 35, 24864-24878, 2022
192022
Generalization to new sequential decision making tasks with in-context learning
SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu
arXiv preprint arXiv:2312.03801, 2023
152023
moolib: A Platform for Distributed RL. 2022
V Mella, E Hambro, D Rothermel, H Küttler
URL https://github. com/facebookresearch/moolib 8, 18, 0
7*
Know when to stop: A study of semantic drift in text generation
A Spataru, E Hambro, E Voita, N Cancedda
arXiv preprint arXiv:2404.05411, 2024
32024
Learning to Solve New sequential decision-making Tasks with In-Context Learning
SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu
NeurIPS 2023 Foundation Models for Decision Making Workshop, 0
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–15