Abbas Abdolmaleki

Cited by

	All	Since 2019
Citations	4048	3797
h-index	27	24
i10-index	44	36

1200

600

300

900

2014201520162017201820192020202120222023202414 28 29 62 81 177 377 536 890 1104 698

Public access

View all

12 articles

1 article

available

not available

Based on funding mandates

Co-authors

Martin RiedmillerDeepMindVerified email at google.com
Nicolas HeessDeepMindVerified email at google.com
Michael NeunertGoogle DeepMindVerified email at google.com
Luis Paulo ReisAssociate Professor, University of PortoVerified email at fe.up.pt
Nuno LauUniversidade de AveiroVerified email at ua.pt
Thomas LampeDeepMindVerified email at google.com
Yuval TassaSenior Research Scientist, Google DeepMindVerified email at google.com
Roland HafnerDeepMindVerified email at google.com
Gerhard NeumannProfessor, Karlsruhe Institute of Technology (KIT)Verified email at robot-learning.de
Noah Y. SiegelGoogle DeepMindVerified email at google.com
Josh MerelVerified email at google.com
Steven BohezGoogle DeepMindVerified email at google.com
Nima ShafiiNVIDIAVerified email at nvidia.com
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIVerified email at ias.tu-darmstadt.de
Rudolf LioutikovTT-Professor, Intuitive Robots Lab, Karlsruhe Institute of TechnologyVerified email at kit.edu
Jost Tobias SpringenbergGoogle DeepMind

Abbas Abdolmaleki

Deepmind

Verified email at google.com

Artificial Intelligence Reinforcement Learning Robotics


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Magnetic control of tokamak plasmas through deep reinforcement learning J Degrave, F Felici, J Buchli, M Neunert, B Tracey, F Carpanese, T Ewalds, ... Nature 602 (7897), 414-419, 2022	707	2022
Deepmind control suite Y Tassa, Y Doron, A Muldal, T Erez, Y Li, DL Casas, D Budden, ... arXiv preprint arXiv:1801.00690, 2018	586	2018
Maximum a posteriori policy optimisation A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ... arXiv preprint arXiv:1806.06920, 2018	505	2018
Keep doing what worked: Behavioral modelling priors for offline reinforcement learning NY Siegel, JT Springenberg, F Berkenkamp, A Abdolmaleki, M Neunert, ... arXiv preprint arXiv:2002.08396, 2020	291	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	239	2020
Robust reinforcement learning for continuous control with model misspecification DJ Mankowitz, N Levine, R Jeong, Y Shi, J Kay, A Abdolmaleki, ... arXiv preprint arXiv:1906.07516, 2019	122	2019
From motor control to team play in simulated humanoid football S Liu, G Lever, Z Wang, J Merel, SMA Eslami, D Hennes, WM Czarnecki, ... Science Robotics 7 (69), eabo0235, 2022	112	2022
V-mpo: On-policy maximum a posteriori policy optimization for discrete and continuous control HF Song, A Abdolmaleki, JT Springenberg, A Clark, H Soyer, JW Rae, ... arXiv preprint arXiv:1909.12238, 2019	112	2019
Model-based relative entropy stochastic search A Abdolmaleki, R Lioutikov, JR Peters, N Lau, L Pualo Reis, G Neumann Advances in Neural Information Processing Systems 28, 2015	93	2015
Continuous-discrete reinforcement learning for hybrid control in robotics M Neunert, A Abdolmaleki, M Wulfmeier, T Lampe, T Springenberg, ... Conference on Robot Learning, 735-751, 2020	90	2020
Beyond pick-and-place: Tackling robotic stacking of diverse shapes AX Lee, CM Devin, Y Zhou, T Lampe, K Bousmalis, JT Springenberg, ... 5th Annual Conference on Robot Learning, 2021	83	2021
A distributional view on multi-objective policy optimization A Abdolmaleki, S Huang, L Hasenclever, M Neunert, F Song, M Zambelli, ... International conference on machine learning, 11-22, 2020	77	2020
Value constrained model-free continuous control S Bohez, A Abdolmaleki, M Neunert, J Buchli, N Heess, R Hadsell arXiv preprint arXiv:1902.04623, 2019	70	2019
Relative entropy regularized policy iteration A Abdolmaleki, JT Springenberg, J Degrave, S Bohez, Y Tassa, D Belov, ... arXiv preprint arXiv:1812.02256, 2018	70	2018
Robocat: A self-improving foundation agent for robotic manipulation K Bousmalis, G Vezzani, D Rao, C Devin, AX Lee, M Bauza, T Davchev, ... arXiv preprint arXiv:2306.11706, 2023	55	2023
Model-free trajectory optimization for reinforcement learning R Akrour, G Neumann, H Abdulsamad, A Abdolmaleki International Conference on Machine Learning, 2961-2970, 2016	49	2016
Data-efficient hindsight off-policy option learning M Wulfmeier, D Rao, R Hafner, T Lampe, A Abdolmaleki, T Hertweck, ... International Conference on Machine Learning, 11340-11350, 2021	46	2021
Imagined value gradients: Model-based policy optimization with tranferable latent dynamics models A Byravan, JT Springenberg, A Abdolmaleki, R Hafner, M Neunert, ... Conference on Robot Learning, 566-589, 2020	43	2020
An optimized gait generator based on fourier series towards fast and robust biped locomotion involving arms swing N Shafii, A Khorsandian, A Abdolmaleki, B Jozi 2009 IEEE International Conference on Automation and Logistics, 2018-2023, 2009	41	2009
Deriving and improving cma-es with information geometric trust regions A Abdolmaleki, B Price, N Lau, LP Reis, G Neumann Proceedings of the Genetic and Evolutionary Computation Conference, 657-664, 2017	40	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors