Gregory Farquhar

Citado por

	Todos	Desde 2019
Citações	6912	6671
Índice h	14	14
Índice i10	18	18

2100

1050

525

1575

2017201820192020202120222023202448 154 436 767 1184 1643 2048 588

Acesso público

Ver tudo

13 artigos

0 artigos

disponível

não disponível

Com base em autorizações de financiamento

Coautores

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoEmail confirmado em cs.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordEmail confirmado em eng.ox.ac.uk
Nantas NardelliCarbon ReEmail confirmado em carbonre.com
Philip TorrProfessor, University of OxfordEmail confirmado em eng.ox.ac.uk
Triantafyllos AfourasMeta AI (FAIR), University of OxfordEmail confirmado em fb.com
Tim RocktäschelProfessor of Artificial Intelligence at UCL, Open-Endedness Team Lead at Google DeepMindEmail confirmado em cs.ucl.ac.uk
Pushmeet KohliDeepMindEmail confirmado em google.com

Seguir

Gregory Farquhar

DeepMind

Email confirmado em google.com

Reinforcement Learning Artificial Intelligence


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21 (178), 1-51, 2020	2105	2020
Counterfactual multi-agent policy gradients J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2021	2018
The starcraft multi-agent challenge M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ... arXiv preprint arXiv:1902.04043, 2019	898	2019
Stabilising experience replay for deep multi-agent reinforcement learning J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ... International conference on machine learning, 1146-1155, 2017	706	2017
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, G Farquhar, B Peng, S Whiteson Advances in neural information processing systems 33, 10199-10210, 2020	312	2020
A survey of reinforcement learning informed by natural language J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ... arXiv preprint arXiv:1906.03926, 2019	281	2019
Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning G Farquhar, T Rocktäschel, M Igl, S Whiteson arXiv preprint arXiv:1710.11417, 2017	140	2017
Multi-agent common knowledge reinforcement learning C Schroeder de Witt, J Foerster, G Farquhar, P Torr, W Boehmer, ... Advances in neural information processing systems 32, 2019	111*	2019
Dice: The infinitely differentiable monte carlo estimator J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, E Xing, S Whiteson International Conference on Machine Learning, 1529-1538, 2018	92	2018
Transient non-stationarity and generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826, 2020	65	2020
Growing action spaces G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve International Conference on Machine Learning, 3040-3051, 2020	33	2020
Proper value equivalence C Grimm, A Barreto, G Farquhar, D Silver, S Singh Advances in Neural Information Processing Systems 34, 7773-7786, 2021	31	2021
The impact of non-stationarity on generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826 8, 2020	29	2020
Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning A Filos, C Lyle, Y Gal, S Levine, N Jaques, G Farquhar International Conference on Machine Learning, 3305-3317, 2021	25	2021
A baseline for any order gradient estimation in stochastic computation graphs J Mao, J Foerster, T Rocktäschel, M Al-Shedivat, G Farquhar, S Whiteson International Conference on Machine Learning, 4343-4351, 2019	12	2019
Counterfactual multi-agent policy gradients. CoRR abs/1705.08926 (2017) JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson arXiv preprint arXiv:1705.08926, 2017	11	2017
Self-consistent models and values G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 1111-1125, 2021	10	2021
Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning G Farquhar, S Whiteson, J Foerster Advances in Neural Information Processing Systems 32, 2019	10	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	9	2021
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021	5	2021

O sistema não pode efectuar a operação agora. Tente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações unidas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores