Suivre
Gregory Farquhar
Gregory Farquhar
DeepMind
Adresse e-mail validée de google.com
Titre
Citée par
Citée par
Année
Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson
Journal of Machine Learning Research 21 (178), 1-51, 2020
20572020
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
19942018
The starcraft multi-agent challenge
M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
arXiv preprint arXiv:1902.04043, 2019
8732019
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
International conference on machine learning, 1146-1155, 2017
7032017
Weighted qmix: Expanding monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, G Farquhar, B Peng, S Whiteson
Advances in neural information processing systems 33, 10199-10210, 2020
3042020
A survey of reinforcement learning informed by natural language
J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ...
arXiv preprint arXiv:1906.03926, 2019
2752019
Treeqn and atreec: Differentiable tree-structured models for deep reinforcement learning
G Farquhar, T Rocktäschel, M Igl, S Whiteson
arXiv preprint arXiv:1710.11417, 2017
1402017
Multi-agent common knowledge reinforcement learning
C Schroeder de Witt, J Foerster, G Farquhar, P Torr, W Boehmer, ...
Advances in neural information processing systems 32, 2019
111*2019
Dice: The infinitely differentiable monte carlo estimator
J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, E Xing, S Whiteson
International Conference on Machine Learning, 1529-1538, 2018
912018
Transient non-stationarity and generalisation in deep reinforcement learning
M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson
arXiv preprint arXiv:2006.05826, 2020
652020
Growing action spaces
G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve
International Conference on Machine Learning, 3040-3051, 2020
322020
Proper value equivalence
C Grimm, A Barreto, G Farquhar, D Silver, S Singh
Advances in Neural Information Processing Systems 34, 7773-7786, 2021
302021
The impact of non-stationarity on generalisation in deep reinforcement learning
M Igl, G Farquhar, J Luketina, W Boehmer, S Whiteson
arXiv preprint arXiv:2006.05826 8, 2020
292020
Psiphi-learning: Reinforcement learning with demonstrations using successor features and inverse temporal difference learning
A Filos, C Lyle, Y Gal, S Levine, N Jaques, G Farquhar
International Conference on Machine Learning, 3305-3317, 2021
222021
A baseline for any order gradient estimation in stochastic computation graphs
J Mao, J Foerster, T Rocktäschel, M Al-Shedivat, G Farquhar, S Whiteson
International Conference on Machine Learning, 4343-4351, 2019
122019
Counterfactual multi-agent policy gradients. CoRR abs/1705.08926 (2017)
JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
arXiv preprint arXiv:1705.08926, 2017
112017
Self-consistent models and values
G Farquhar, K Baumli, Z Marinho, A Filos, M Hessel, HP van Hasselt, ...
Advances in Neural Information Processing Systems 34, 1111-1125, 2021
102021
Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning
G Farquhar, S Whiteson, J Foerster
Advances in Neural Information Processing Systems 32, 2019
102019
Model-value inconsistency as a signal for epistemic uncertainty
A Filos, E Vértes, Z Marinho, G Farquhar, D Borsa, A Friesen, ...
arXiv preprint arXiv:2112.04153, 2021
92021
No DICE: An investigation of the bias-variance tradeoff in meta-gradients
R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson
Deep RL Workshop NeurIPS 2021, 2021
52021
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20