Zafarali Ahmed
Zafarali Ahmed
DeepMind
Adresse e-mail validée de google.com - Page d'accueil
Titre
Citée par
Citée par
Année
Understanding the impact of entropy on policy optimization
Z Ahmed, N Le Roux, M Norouzi, D Schuurmans
International Conference on Machine Learning (ICML) 2019, 151-160, 2019
782019
InfoBot: Transfer and Exploration via the Information Bottleneck
A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ...
International Conference on Learning Representations (ICLR) 2019, 2019
732019
Intratumor Heterogeneity and Circulating Tumor Cell Clusters
Z Ahmed, S Gravel
Molecular Biology and Evolution, 2017
122017
What can I do here? A Theory of Affordances in Reinforcement Learning
K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup
International Conference on Machine Learning (ICML) 2020, 5479--5488, 2020
112020
Vfunc: a deep generative model for functions
P Bachman, R Islam, A Sordoni, Z Ahmed
Workshop on Prediction and Generative Modeling in Reinforcement Learning at …, 2018
72018
Learning to prove from synthetic theorems
E Aygün, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ...
arXiv preprint arXiv:2006.11259, 2020
62020
Marginalized state distribution entropy regularization in policy optimization
R Islam, Z Ahmed, D Precup
arXiv preprint arXiv:1912.05128, 2019
52019
RE-EVALUATE: Reproducibility in Evaluating Reinforcement Learning Algorithms
K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau
2nd Reproducibility in Machine Learning Workshop at ICML 2018, 2018
52018
Learning proposals for sequential importance samplers using reinforced variational inference
Z Ahmed, A Karuvally, D Precup, S Gravel
Deep RL Meets Structured Prediction Workshop at ICLR, 2019
12019
AndroidEnv: A Reinforcement Learning Platform for Android
D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ...
arXiv preprint arXiv:2105.13231, 2021
2021
Training a First-Order Theorem Prover from Synthetic Data
V Firoiu, E Aygun, A Anand, Z Ahmed, X Glorot, L Orseau, L Zhang, ...
arXiv preprint arXiv:2103.03798, 2021
2021
Unifying Variational Inference and Policy Optimization
Z Ahmed
McGill University, 2019
2019
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–12