Bilal Piot

Cited by

	All	Since 2019
Citations	15091	14243
h-index	36	33
i10-index	46	44

4500

2250

1125

3375

2014201520162017201820192020202120222023202448 43 91 128 467 842 1325 2454 3657 4463 1490

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Mohammad Gheshlaghi AzarCohereVerified email at google.com
Zhaohan Daniel GuoDeepMindVerified email at google.com
Rémi MunosDeepMindVerified email at inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Florent AltchéResearch Engineer, DeepMindVerified email at google.com
Jean-bastien GrillVerified email at google.com
Florian STRUBDeepMindVerified email at google.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Corentin TallecDeepMindVerified email at google.com
Pierre RichemondGoogle DeepMindVerified email at deepmind.com
Charles BlundellResearch Scientist at DeepMindVerified email at google.com
Todd HesterWaymoVerified email at waymo.com
Pablo SprechmannResearch Scientist at Google DeepMindVerified email at google.com
Steven KapturowskiDeepMindVerified email at google.com
Mel VecerikDeepMind, University College LondonVerified email at ucl.ac.uk
Dan HorganGoogle DeepMindVerified email at google.com
Adrià Puigdomènech BadiaDeepMindVerified email at google.com
Alex VitvitskyiDeepMindVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com

Bilal Piot

Google Deepmind

Verified email at google.com

reinforcement learning inverse reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	5867	2020
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2494	2018
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1166	2018
Noisy Networks for Exploration M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295 2018, 2017	1124*	2017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ... arXiv preprint arXiv:1707.08817, 2017	744	2017
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	598	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	318	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	231	2020
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	175	2017
Mastering the game of stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	142	2022
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	138	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	129	2018
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	119	2012
Approximate dynamic programming for two-player zero-sum markov games J Perolat, B Scherrer, B Piot, O Pietquin International Conference on Machine Learning, 1321-1329, 2015	113	2015
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	100	2016
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017	97	2017
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	89	2019
Boosted bellman residual minimization handling expert demonstrations B Piot, M Geist, O Pietquin Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	87	2014
Byol works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020	85	2020
End-to-end optimization of goal-driven and visually grounded dialogue systems F Strub, H De Vries, J Mary, B Piot, A Courville, O Pietquin arXiv preprint arXiv:1703.05423, 2017	84	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors