Zafarali Ahmed

Citée par

	Toutes	Depuis 2019
Citations	1074	1071
indice h	11	11
indice i10	12	12

520

260

130

390

20182019202020212022202320243 53 96 107 175 136 503

Accès public

Tout afficher

3 articles

0 article

disponibles

non disponibles

Sur la base des exigences liées au financement

Coauteurs

Doina PrecupDeepMind and McGill UniversityAdresse e-mail validée de cs.mcgill.ca
Khimya KhetarpalGoogle Deepmind, MilaAdresse e-mail validée de google.com
Nicolas Le RouxMicrosoft Research, McGill, UdeMAdresse e-mail validée de le-roux.name
Simon GravelAssociate Professor, McGill UniversityAdresse e-mail validée de mcGill.ca

Suivre

Zafarali Ahmed

Google DeepMind

Adresse e-mail validée de google.com - Page d'accueil

Machine Learning Reinforcement Learning Computational Biology


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	443	2023
Understanding the impact of entropy on policy optimization Z Ahmed, N Le Roux, M Norouzi, D Schuurmans International Conference on Machine Learning (ICML) 2019, 151-160, 2019	218	2019
InfoBot: Transfer and Exploration via the Information Bottleneck A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ... International Conference on Learning Representations (ICLR) 2019, 2019	161	2019
What can I do here? A Theory of Affordances in Reinforcement Learning K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup International Conference on Machine Learning (ICML) 2020, 5479--5488, 2020	63	2020
Androidenv: A reinforcement learning platform for android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021	42	2021
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	28	2024
Learning to prove from synthetic theorems E Aygün, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ... arXiv preprint arXiv:2006.11259, 2020	21	2020
RE-EVALUATE: Reproducibility in Evaluating Reinforcement Learning Algorithms K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau 2nd Reproducibility in Machine Learning Workshop at ICML 2018, 2018	20	2018
Intratumor Heterogeneity and Circulating Tumor Cell Clusters Z Ahmed, S Gravel Molecular Biology and Evolution, 2017	20	2017
Marginalized state distribution entropy regularization in policy optimization R Islam, Z Ahmed, D Precup arXiv preprint arXiv:1912.05128, 2019	15	2019
Training a first-order theorem prover from synthetic data V Firoiu, E Aygun, A Anand, Z Ahmed, X Glorot, L Orseau, L Zhang, ... arXiv preprint arXiv:2103.03798, 2021	13	2021
Temporally abstract partial models K Khetarpal, Z Ahmed, G Comanici, D Precup Advances in Neural Information Processing Systems 34, 1979-1991, 2021	11	2021
Vfunc: a deep generative model for functions P Bachman, R Islam, A Sordoni, Z Ahmed Workshop on Prediction and Generative Modeling in Reinforcement Learning at …, 2018	8	2018
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	6	2024
Generalized policy updates for policy optimization S Kumar, R Dadashi, Z Ahmed, D Schuurmans, MG Bellemare NeurIPS 2019 Optimization Foundations for Reinforcement Learning Workshop, 2019	2	2019
Discrete off-policy policy gradient using continuous relaxations A Cianflone, Z Ahmed, R Islam, AJ Bose, WL Hamilton Unpublished. https://joeybose. github. io/assets/Gradient_estimator. pdf, 2019	2	2019
Learning proposals for sequential importance samplers using reinforced variational inference Z Ahmed, A Karuvally, D Precup, S Gravel Deep RL Meets Structured Prediction Workshop at ICLR, 2019	1	2019
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning G Comanici, A Glaese, A Gergely, D Toyama, Z Ahmed, T Jackson, ... arXiv preprint arXiv:2204.10374, 2022		2022
Unifying Variational Inference and Policy Optimization Z Ahmed McGill University, 2019		2019

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–19

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs