Follow
Julian Schrittwieser
Julian Schrittwieser
DeepMind
Verified email at furidamu.org - Homepage
Title
Cited by
Cited by
Year
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
129512016
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
72442017
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
22782018
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
13472017
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
Nature 588 (7839), 604-609, 2020
8432020
Starcraft ii: A new challenge for reinforcement learning
O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ...
arXiv preprint arXiv:1708.04782, 2017
7002017
Deepmind lab
C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ...
arXiv preprint arXiv:1612.03801, 2016
3932016
others. 2016. Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
Nature 529 (7587), 484-489, 2016
1112016
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
1022019
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
872018
et almbox. 2016. Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
512016
Online and offline reinforcement learning by planning with a learned model
J Schrittwieser, T Hubert, A Mandhane, M Barekatain, I Antonoglou, ...
Advances in Neural Information Processing Systems 34, 2021
212021
Panneershelvam Veda
S David, H Aja, J Maddison Chris, G Arthur, S Laurent, ...
Lanctot Marc, Dieleman Sander, Grewe Dominik, Nham John, Kalchbrenner Nal …, 2016
162016
Learning and planning in complex action spaces
T Hubert, J Schrittwieser, I Antonoglou, M Barekatain, S Schmitt, D Silver
International Conference on Machine Learning, 4476-4486, 2021
122021
Local search for policy iteration in continuous control
JT Springenberg, N Heess, D Mankowitz, J Merel, A Byravan, ...
arXiv preprint arXiv:2010.05545, 2020
112020
Competition-level code generation with alphacode
Y Li, D Choi, J Chung, N Kushman, J Schrittwieser, R Leblond, T Eccles, ...
arXiv preprint arXiv:2203.07814, 2022
92022
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
S David, H Thomas, S Julian, A Ioannis, L Matthew, G Arthur, L Marc, ...
arXiv preprint arXiv:1712.01815, 2017
82017
Approximate exploitability: Learning a best response in large games
F Timbers, E Lockhart, M Lanctot, M Schmid, J Schrittwieser, T Hubert, ...
arXiv preprint arXiv:2004.09677, 2020
62020
Procedural Generalization by Planning with Self-Supervised World Models
A Anand, J Walker, Y Li, E Vértes, J Schrittwieser, S Ozair, T Weber, ...
arXiv preprint arXiv:2111.01587, 2021
52021
MuZero with Self-competition for Rate Control in VP9 Video Compression
A Mandhane, A Zhernov, M Rauh, C Gu, M Wang, F Xue, W Shang, ...
arXiv preprint arXiv:2202.06626, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20