Eric Hambro

Citeret af

	Alle	Siden 2020
Henvisninger	15090	15075
h-index	12	12
i10-indeks	12	12

11000

5500

2750

8250

202220232024202595 3598 10052 1271

Offentlig adgang

Se alle

1 artikel

0 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Heinrich KüttlerxAIVerificeret mail på math.lmu.de
Tim RocktäschelDirector and Open-Endedness Team Lead at Google DeepMind, Professor of AI at UCL, Fellow ELLISVerificeret mail på cs.ucl.ac.uk
Mikayel SamvelyanGoogle DeepMindVerificeret mail på google.com
Roberta RaileanuResearch Scientist at Meta, Honorary Lecturer at UCL
Sharath Chandra RaparthyMember of Technical Staff at Reka AI

Følg

Eric Hambro

Anthropic

Verificeret mail på anthropic.com - Startside

Machine Learning Reinforcement Learning Natural Language Processing


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
LLaMA: Open and efficient foundation language models H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv:2302.13971, 2023	13198	2023
Toolformer: Language models can teach themselves to use tools T Schick, J Dwivedi-Yu, R Dessì, R Raileanu, M Lomeli, E Hambro, ... Advances in Neural Information Processing Systems 36, 68539-68551, 2023	1460	2023
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... NeurIPS 2021 Datasets and Benchmarks, 2021	97	2021
Understanding the effects of rlhf on llm generalisation and diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023	87	2023
Rainbow teaming: Open-ended generation of diverse adversarial prompts M Samvelyan, SC Raparthy, A Lupu, E Hambro, A Markosyan, M Bhatt, ... Advances in Neural Information Processing Systems 37, 69747-69786, 2025	49	2025
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	48	2024
GPflux: A library for deep Gaussian processes V Dutordoir, H Salimbeni, E Hambro, J McLeod, F Leibfried, A Artemev, ... arXiv preprint arXiv:2104.05674, 2021	35	2021
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... arXiv preprint arXiv:2402.10963, 2024	30	2024
Insights from the Neurips 2021 Nethack Challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	22	2022
LLaMA: open and efficient foundation language models. DOI: 10.48550 H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv.2302.13971 2302, 2023	20	2023
Dungeons and Data: A Large-Scale NetHack Dataset E Hambro, R Raileanu, D Rothermel, V Mella, T Rocktäschel, H Küttler, ... Advances in Neural Information Processing Systems 35, 24864-24878, 2022	19	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	15	2023
moolib: A Platform for Distributed RL. 2022 V Mella, E Hambro, D Rothermel, H Küttler URL https://github. com/facebookresearch/moolib 8, 18, 0	7*
Know when to stop: A study of semantic drift in text generation A Spataru, E Hambro, E Voita, N Cancedda arXiv preprint arXiv:2404.05411, 2024	3	2024
Learning to Solve New sequential decision-making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu NeurIPS 2023 Foundation Models for Decision Making Workshop, 0

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–15

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere