LLaMA: Open and efficient foundation language models H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv:2302.13971, 2023 | 2399 | 2023 |
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... NeurIPS 2021 Datasets and Benchmarks, 2021 | 61 | 2021 |
GPflux: A library for deep Gaussian processes V Dutordoir, H Salimbeni, E Hambro, J McLeod, F Leibfried, A Artemev, ... arXiv preprint arXiv:2104.05674, 2021 | 23 | 2021 |
Insights from the Neurips 2021 Nethack Challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022 | 12 | 2022 |
LLaMA: Open and Efficient Foundation Language Models. DOI: 10.48550 H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint ARXIV.2302.13971, 2023 | 8 | 2023 |
Dungeons and Data: A Large-Scale NetHack Dataset E Hambro, R Raileanu, D Rothermel, V Mella, T Rocktäschel, H Küttler, ... Advances in Neural Information Processing Systems 35, 24864-24878, 2022 | 4 | 2022 |
moolib: A Platform for Distributed RL. 2022 V Mella, E Hambro, D Rothermel, H Küttler URL https://github. com/facebookresearch/moolib 8, 17, 0 | 4 | |
Understanding the Effects of RLHF on LLM Generalisation and Diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023 | 1 | 2023 |
Learning to Solve New sequential decision-making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023 | | 2023 |