Gregory Farquhar

Cited by

	All	Since 2020
Citations	9067	8379
h-index	14	14
i10-index	20	19

2200

1100

550

1650

20172018201920202021202220232024202548 157 438 778 1173 1648 2128 2125 521

Public access

View all

13 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Jakob FoersterAssociate Professor, University of OxfordVerified email at eng.ox.ac.uk
Nantas NardelliPacific FusionVerified email at pacificfusion.com
Philip TorrProfessor, University of OxfordVerified email at eng.ox.ac.uk
Triantafyllos AfourasFAIR, Meta, University of OxfordVerified email at fb.com
Tim RocktäschelDirector and Open-Endedness Team Lead at Google DeepMind, Professor of AI at UCL, Fellow ELLISVerified email at cs.ucl.ac.uk
Pushmeet KohliDeepMindVerified email at google.com

Gregory Farquhar

DeepMind

Verified email at google.com

Reinforcement Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson Journal of Machine Learning Research 21 (178), 1-51, 2020	2952	2020
Counterfactual multi-agent policy gradients J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2563	2018
The starcraft multi-agent challenge M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ... arXiv preprint arXiv:1902.04043, 2019	1249	2019
Stabilising experience replay for deep multi-agent reinforcement learning J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ... International conference on machine learning, 1146-1155, 2017	803	2017
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning T Rashid, G Farquhar, B Peng, S Whiteson Advances in neural information processing systems 33, 10199-10210, 2020	426	2020
A survey of reinforcement learning informed by natural language J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ... arXiv preprint arXiv:1906.03926, 2019	334	2019
Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning G Farquhar, T Rocktäschel, M Igl, S Whiteson arXiv preprint arXiv:1710.11417, 2017	161	2017
Multi-agent common knowledge reinforcement learning C Schroeder de Witt, J Foerster, G Farquhar, P Torr, W Boehmer, ... Advances in neural information processing systems 32, 2019	131*	2019
Dice: The infinitely differentiable monte carlo estimator J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, E Xing, S Whiteson International Conference on Machine Learning, 1529-1538, 2018	110	2018
Transient non-stationarity and generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826, 2020	98	2020
Growing action spaces G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve International Conference on Machine Learning, 3040-3051, 2020	42	2020
Proper value equivalence C Grimm, A Barreto, G Farquhar, D Silver, S Singh Advances in neural information processing systems 34, 7773-7786, 2021	40	2021
Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning A Filos, C Lyle, Y Gal, S Levine, N Jaques, G Farquhar International Conference on Machine Learning, 3305-3317, 2021	39	2021
The impact of non-stationarity on generalisation in deep reinforcement learning M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson arXiv preprint arXiv:2006.05826 8, 2020	37	2020
Self-consistent models and values G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 1111-1125, 2021	13	2021
A baseline for any order gradient estimation in stochastic computation graphs J Mao, J Foerster, T Rocktäschel, M Al-Shedivat, G Farquhar, S Whiteson International Conference on Machine Learning, 4343-4351, 2019	13	2019
Discovering general reinforcement learning algorithms with adversarial environment design MT Jackson, M Jiang, J Parker-Holder, R Vuorio, C Lu, G Farquhar, ... Advances in Neural Information Processing Systems 36, 79980-79998, 2023	12	2023
Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning G Farquhar, S Whiteson, J Foerster Advances in Neural Information Processing Systems 32, 2019	12	2019
Model-value inconsistency as a signal for epistemic uncertainty A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ... arXiv preprint arXiv:2112.04153, 2021	11	2021
Counterfactual multi-agent policy gradients. CoRR abs/1705.08926 (2017) JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson arXiv preprint arXiv:1705.08926, 2017	11	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors