Ahmed Touati

Citée par

	Toutes	Depuis 2019
Citations	447	418
indice h	12	12
indice i10	14	14

100

2016201720182019202020212022202320244 8 10 42 61 82 100 100 33

Accès public

Tout afficher

4 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Pascal VincentFacebook AI Research; U. Montreal (Professor, Computer Sc. & Op. Res.); MILA; CIFARAdresse e-mail validée de iro.umontreal.ca
Joshua RomoffUbisoft La ForgeAdresse e-mail validée de ubisoft.com
Pierre-Luc BaconUniversity of MontrealAdresse e-mail validée de mila.quebec
Nan Rosemary KeGoogle, Deepmind, MilaAdresse e-mail validée de google.com
Simon Lacoste-JulienAssociate Professor - Canada CIFAR AI Chair, University of Montreal / MilaAdresse e-mail validée de iro.umontreal.ca
Doina PrecupDeepMind and McGill UniversityAdresse e-mail validée de cs.mcgill.ca
Chin-Wei HuangMicrosoft ResearchAdresse e-mail validée de microsoft.com
Aaron CourvilleProfessor, DIRO, Université de Montréal, Mila, Cifar CAI chairAdresse e-mail validée de umontreal.ca
Gabriel HuangPhD candidate, Mila & Visiting Researcher, ServiceNowAdresse e-mail validée de umontreal.ca
Gauthier GidelAssistant professor at Mila, University of Montréal (DIRO)Adresse e-mail validée de umontreal.ca
Harsh SatijaMcGill University, MilaAdresse e-mail validée de mail.mcgill.ca
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at AustinAdresse e-mail validée de austin.utexas.edu
Laurent DinhAppleAdresse e-mail validée de apple.com
Jerome Le NyProfessor of Electrical Engineering, Polytechnique Montreal, and GERADAdresse e-mail validée de polymtl.ca
Marc G. BellemareGoogle BrainAdresse e-mail validée de google.com
Adrien Ali TaïgaUniversité de MontréalAdresse e-mail validée de umontreal.ca

Suivre

Ahmed Touati

Meta AI

Adresse e-mail validée de umontreal.ca

Machine learning Reinforcement learning


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future NR Ke, A Singh, A Touati, A Goyal, Y Bengio, D Parikh, D Batra ICLR 2019 - Proceedings of the Seventh International Conference on Learning …, 2019	81*	2019
Learning One Representation to Optimize All Rewards A Touati, Y Ollivier NeurIPS 2021: Thirty-fifth Conference on Neural Information Processing Systems, 2021	49	2021
Convergent Tree Backup and Retrace with Function Approximation A Touati, PL Bacon, D Precup, P Vincent ICML 2018, Proceedings of the 35th International Conference on Machine …, 2017	46	2017
Randomized value functions via multiplicative normalizing flows A Touati, H Satija, J Romoff, J Pineau, P Vincent UAI2019: Conference on Uncertainty in Artificial Intelligence 2019, 2018	38	2018
Efficient learning in non-stationary linear markov decision processes A Touati, P Vincent arXiv preprint arXiv:2010.12870, 2020	33	2020
Learnable explicit density for continuous latent space and variational inference CW Huang, A Touati, L Dinh, M Drozdzal, M Havaei, L Charlin, ... ICML 2017 Workshop on Principle Approaches to Deep Learning (padl), 2017	30	2017
Real-time privacy-preserving model-based estimation of traffic flows J Le Ny, A Touati, GJ Pappas 2014 ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS), 92-102, 2014	29	2014
Separable value functions across time-scales J Romoff, P Henderson, A Touati, Y Ollivier, J Pineau, E Brunskill ICML 2019, Proceedings of the 36th International Conference on Machine …, 2019	26*	2019
Parametric adversarial divergences are good task losses for generative modeling G Huang, H Berard, A Touati, G Gidel, P Vincent, S Lacoste-Julien MAIS18, Montreal AI Symposium 2018, 2017	20*	2017
Does Zero-Shot Reinforcement Learning Exist? A Touati, J Rapin, Y Ollivier ICLR 2023, 2022	18	2022
Stable Policy Optimization via Off-Policy Divergence Regularization A Touati, A Zhang, J Pineau, P Vincent UAI2020: Conference on Uncertainty in Artificial Intelligence 2020, 2020	16	2020
Zooming for efficient model-free reinforcement learning in metric spaces A Touati, AA Taiga, MG Bellemare arXiv preprint arXiv:2003.04069, 2020	16	2020
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau Proceedings of the 20th International Conference on Autonomous Agents and …, 2021	10*	2021
SVRG for policy evaluation with fewer gradient evaluations Z Peng, A Touati, P Vincent, D Precup IJCAI2020: the 29th International Joint Conference on Artificial Intelligence, 2019	10	2019
Maximum reward formulation in reinforcement learning SK Gottipati, Y Pathak, R Nuttall, R Chunduru, A Touati, SG Subramanian, ... arXiv preprint arXiv:2010.03744, 2020	9*	2020
Stochastic Neural Network with Kronecker Flow CW Huang, A Touati, P Vincent, GK Dziugaite, A Lacoste, A Courville AISTATS 2020 - Proceedings of the 23nd International Conference on …, 2019	8	2019
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees A Tirinzoni, M Papini, A Touati, A Lazaric, M Pirotta NeurIPS 2022, 2022	4	2022
Adaptive Stochastic Dual Coordinate Ascent for Conditional Random Fields RL Priol, A Touati, S Lacoste-Julien OPTML 2017: 10th NIPS Workshop on Optimization for Machine Learning (NIPS 2017), 2017	2	2017
Large state spaces and self-supervision in reinforcement learning A Touati	1	2022
Sharp Analysis of Smoothed Bellman Error Embedding A Touati, P Vincent ICML 2020 Workshop on Theoretical Foundations of Reinforcement Learning, 2020	1	2020

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs