‪Eric Hambro‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	6289	6284
h-index	7	7
i10-index	7	7

0

3800

1900

950

2850

20222023202468 3743 2437

Co-authors

Roberta RaileanuResearch Scientist, MetaVerified email at fb.com
Heinrich KüttlerInflection AIVerified email at math.lmu.de
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindVerified email at cs.ucl.ac.uk
Mikayel SamvelyanMeta AI & UCLVerified email at meta.com
Sharath Chandra RaparthyMeta AIVerified email at mila.quebec

Eric Hambro

Eric Hambro

Anthropic

Verified email at anthropic.com - Homepage

Machine Learning Reinforcement Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
LLaMA: Open and efficient foundation language models H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv:2302.13971, 2023	5403	2023
Toolformer: Language models can teach themselves to use tools T Schick, J Dwivedi-Yu, R Dessì, R Raileanu, M Lomeli, E Hambro, ... Advances in Neural Information Processing Systems 36, 2024	736	2024
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... NeurIPS 2021 Datasets and Benchmarks, 2021	69	2021
GPflux: a library for deep gaussian processes V Dutordoir, H Salimbeni, E Hambro, J McLeod, F Leibfried, A Artemev, ... arXiv preprint arXiv:2104.05674, 2021	25	2021
Insights from the Neurips 2021 Nethack Challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	17	2022
Understanding the effects of rlhf on llm generalisation and diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023	16	2023
Dungeons and Data: A Large-Scale NetHack Dataset E Hambro, R Raileanu, D Rothermel, V Mella, T Rocktäschel, H Küttler, ... Advances in Neural Information Processing Systems 35, 24864-24878, 2022	10	2022
moolib: A Platform for Distributed RL. 2022 V Mella, E Hambro, D Rothermel, H Küttler URL https://github. com/facebookresearch/moolib 8, 18, 2022	7*	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	4	2023
Teaching Large Language Models to Reason with Reinforcement Learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	1	2024
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... arXiv preprint arXiv:2402.10963, 2024	1	2024
Know When To Stop: A Study of Semantic Drift in Text Generation A Spataru, E Hambro, E Voita, N Cancedda arXiv preprint arXiv:2404.05411, 2024		2024
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts M Samvelyan, SC Raparthy, A Lupu, E Hambro, AH Markosyan, M Bhatt, ... arXiv preprint arXiv:2402.16822, 2024		2024
Learning to Solve New sequential decision-making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–14