Matthieu Geist

Cited by

	All	Since 2019
Citations	5958	4582
h-index	40	35
i10-index	102	80

1200

600

300

900

200920102011201220132014201520162017201820192020202120222023202428 45 95 111 168 176 143 254 151 186 253 440 762 901 1179 1040

Public access

View all

13 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Bilal PiotGoogle DeepmindVerified email at google.com
Léonard HussenotGoogle DeepMindVerified email at google.com
Olivier BachemResearch Scientist, Google BrainVerified email at google.com
Mathieu LaurièreAssistant professor of Mathematics and Data Science, NYU ShanghaiVerified email at nyu.edu
Nino VieillardGoogle DeepMindVerified email at google.com
Senthilkumar ChandramohanDirector of ML EngineeringVerified email at staples.com
julien perolatDeepMindVerified email at google.com
Prof. Cédric PradalierGeorgiaTech Lorraine, UMI2958 GT-CNRS, MetzVerified email at georgiatech-metz.fr
Romuald ElieDeepmind & Université Gustave EiffelVerified email at u-pem.fr
Robert DadashiGoogle DeepMindVerified email at google.com
Anton RaichukGoogle AIVerified email at google.com
Erinc MerdivanHelmholtz AI (HMGU)Verified email at helmholtz-muenchen.de
Edouard KLEINBeaver LabsVerified email at beaver-labs.com
Johan FerretResearch Scientist, Google DeepMindVerified email at google.com
Raphaël MarinierGoogle AIVerified email at google.com
Sten HankeAssoc. Prof at FH JoanneumVerified email at fh-joanneum.at
Piotr StanczykGoogleVerified email at google.com
Johannes KropfAIT Austrian Institute of TechnologyVerified email at kropf.at
Marcin AndrychowiczGoogle BrainVerified email at openai.com

Matthieu Geist

Cohere (ex Google, on leave of Professor, Université de Lorraine)

Verified email at univ-lorraine.fr

reinforcement learning machine learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	542	2023
What matters for on-policy deep actor-critic methods? a large-scale study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... International conference on learning representations, 2020	369*	2020
A theory of regularized markov decision processes M Geist, B Scherrer, O Pietquin International Conference on Machine Learning, 2160-2169, 2019	293	2019
Human activity recognition using recurrent neural networks D Singh, E Merdivan, I Psychoula, J Kropf, S Hanke, M Geist, A Holzinger Machine Learning and Knowledge Extraction: First IFIP TC 5, WG 8.4, 8.9, 12 …, 2017	208	2017
Approximate modified policy iteration and its application to the game of Tetris. B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist J. Mach. Learn. Res. 16 (49), 1629-1676, 2015	149	2015
Kalman temporal differences M Geist, O Pietquin Journal of artificial intelligence research 39, 483-532, 2010	124	2010
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	122	2012
Primal wasserstein imitation learning R Dadashi, L Hussenot, M Geist, O Pietquin arXiv preprint arXiv:2006.04678, 2020	121	2020
Algorithmic survey of parametric value function approximation M Geist, O Pietquin IEEE Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013	121*	2013
Sample-efficient batch reinforcement learning for dialogue management optimization O Pietquin, M Geist, S Chandramohan, H Frezza-Buet ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 1-21, 2011	120	2011
User simulation in dialogue systems using inverse reinforcement learning S Chandramohan, M Geist, F Lefevre, O Pietquin Interspeech 2011, 1025-1028, 2011	116	2011
On the convergence of model free learning in mean field games R Elie, J Perolat, M Laurière, M Geist, O Pietquin Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7143-7150, 2020	113*	2020
IQ-Learn: Inverse soft-Q Learning for Imitation D Garg, S Chakraborty, C Cundy, J Song, M Geist, S Ermon arXiv preprint arXiv:2106.12142, 2022	111	2022
Fictitious play for mean field games: Continuous time analysis and applications S Perrin, J Pérolat, M Laurière, M Geist, R Elie, O Pietquin Advances in neural information processing systems 33, 13199-13213, 2020	109	2020
Off-policy learning with eligibility traces: a survey. M Geist, B Scherrer J. Mach. Learn. Res. 15 (1), 289-333, 2014	107	2014
Leverage the average: an analysis of kl regularization in reinforcement learning N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist Advances in Neural Information Processing Systems 33, 12163-12174, 2020	101*	2020
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	101	2016
Convolutional and recurrent neural networks for activity recognition in smart environment D Singh, E Merdivan, S Hanke, J Kropf, M Geist, A Holzinger Towards Integrative Machine Learning and Knowledge Extraction: BIRS Workshop …, 2017	93	2017
Boosted bellman residual minimization handling expert demonstrations B Piot, M Geist, O Pietquin Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	88	2014
Munchausen reinforcement learning N Vieillard, O Pietquin, M Geist Advances in Neural Information Processing Systems 33, 4235-4246, 2020	85	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors