Joel Z Leibo

Citée par

	Toutes	Depuis 2019
Citations	12863	10917
indice h	40	35
indice i10	64	54

2800

1400

700

2100

20132014201520162017201820192020202120222023202468 84 91 153 435 901 1281 1752 2131 2292 2732 713

Accès public

Tout afficher

10 articles

2 articles

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLAdresse e-mail validée de ucl.ac.uk
TOMASO POGGIOMcDermott Professor in Brain Sciences, MITAdresse e-mail validée de ai.mit.edu
Edward HughesStaff Research Engineer, DeepMindAdresse e-mail validée de google.com
Marc LanctotResearch Scientist, Google DeepMindAdresse e-mail validée de google.com
Edgar A. Duéñez-GuzmánGoogle DeepMindAdresse e-mail validée de oeb.harvard.edu
Karl TuylsResearch Scientist, Google DeepMind and Professor of computer science, University of LiverpoolAdresse e-mail validée de google.com
Wojciech Marian Czarnecki.Adresse e-mail validée de google.com
Matthew BotvinickGoogle DeepMind, Yale Law School, University College LondonAdresse e-mail validée de google.com
Charlie BeattieSoftware Engineer, DeepMindAdresse e-mail validée de google.com
Peter SunehagGoogle - DeepMindAdresse e-mail validée de google.com
Tom SchaulSenior Staff Scientist, DeepMindAdresse e-mail validée de nyu.edu
Kevin R. McKeeStaff Research Scientist, Google DeepMindAdresse e-mail validée de deepmind.com
Audrūnas GruslysAdresse e-mail validée de gruslys.com
Raphael KösterGoogle DeepMindAdresse e-mail validée de google.com
Jane X. WangStaff Research Scientist, DeepMindAdresse e-mail validée de google.com
Max JaderbergChief AI Officer, Isomorphic LabsAdresse e-mail validée de robots.ox.ac.uk
Fabio AnselmiAssistant professor at University of Trieste, MIT affiliateAdresse e-mail validée de units.it
Vinicius ZambaldiGoogle DeepmindAdresse e-mail validée de google.com
Dharshan KumaranGoogle DeepMindAdresse e-mail validée de fil.ion.ucl.ac.uk
Lorenzo RosascoMaLGa Machine Learning Genoa Center - Università degli Studi di GenovaAdresse e-mail validée de unige.it

Suivre

Joel Z Leibo

Research scientist

Adresse e-mail validée de google.com - Page d'accueil

Cooperation in AI & Neuroscience Multi-Agent Reinforcement Learning Machine Learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017	1566	2017
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1347	2016
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1322*	2018
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1021	2016
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019	905	2019
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017	892	2017
Prefrontal cortex as a meta-reinforcement learning system JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ... Nature neuroscience 21 (6), 860-868, 2018	603	2018
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016	584	2016
Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019	497*	2019
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016	287*	2016
The dynamics of invariant object recognition in the human visual system L Isik, EM Meyers, JZ Leibo, T Poggio Journal of neurophysiology 111 (1), 91-102, 2014	272	2014
Using fast weights to attend to the recent past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances in neural information processing systems 29, 2016	257	2016
Inequity aversion improves cooperation in intertemporal social dilemmas E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ... Advances in neural information processing systems 31, 2018	233	2018
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in neural information processing systems 30, 2017	209	2017
Unsupervised predictive memory in a goal-directed agent G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ... arXiv preprint arXiv:1803.10760, 2018	194	2018
Open problems in cooperative ai A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ... arXiv preprint arXiv:2012.08630, 2020	184	2020
Emergent communication through negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	177	2018
How important is weight symmetry in backpropagation? Q Liao, J Leibo, T Poggio Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	166	2016
Unsupervised learning of invariant representations F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio Theoretical Computer Science 633, 112-121, 2016	140	2016
Kickstarting deep reinforcement learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... arXiv preprint arXiv:1803.03835, 2018	131	2018

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs