Junhyuk Oh
Junhyuk Oh
Research Scientist, DeepMind
Adresse e-mail validée de google.com - Page d'accueil
Titre
Citée par
Citée par
Année
Grandmaster level in StarCraft II using multi-agent reinforcement learning
O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ...
Nature 575 (7782), 350-354, 2019
689*2019
Action-conditional video prediction using deep networks in atari games
J Oh, X Guo, H Lee, RL Lewis, S Singh
Advances in neural information processing systems, 2863-2871, 2015
6162015
Control of memory, active perception, and action in minecraft
J Oh, V Chockalingam, S Singh, H Lee
arXiv preprint arXiv:1605.09128, 2016
1902016
Value prediction network
J Oh, S Singh, H Lee
Advances in Neural Information Processing Systems, 6118-6128, 2017
1722017
Zero-shot task generalization with multi-task deep reinforcement learning
J Oh, S Singh, H Lee, P Kohli
arXiv preprint arXiv:1706.05064, 2017
1462017
Learning transferrable knowledge for semantic segmentation with deep convolutional neural network
S Hong, J Oh, H Lee, B Han
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
1362016
Self-imitation learning
J Oh, Y Guo, S Singh, H Lee
arXiv preprint arXiv:1806.05635, 2018
882018
On learning intrinsic rewards for policy gradient methods
Z Zheng, J Oh, S Singh
Advances in Neural Information Processing Systems, 4644-4654, 2018
522018
Many-goals reinforcement learning
V Veeriah, J Oh, S Singh
arXiv preprint arXiv:1806.09605, 2018
342018
Contingency-aware exploration in reinforcement learning
J Choi, Y Guo, M Moczulski, J Oh, N Wu, M Norouzi, H Lee
arXiv preprint arXiv:1811.01483, 2018
282018
Unicorn: Continual learning with a universal, off-policy agent
DJ Mankowitz, A Žídek, A Barreto, D Horgan, M Hessel, J Quan, J Oh, ...
arXiv preprint arXiv:1802.08294, 2018
232018
Hierarchical reinforcement learning for zero-shot generalization with subtask dependencies
S Sohn, J Oh, H Lee
Advances in Neural Information Processing Systems, 7156-7166, 2018
23*2018
Discovery of useful questions as auxiliary tasks
V Veeriah, M Hessel, Z Xu, J Rajendran, RL Lewis, J Oh, HP van Hasselt, ...
Advances in Neural Information Processing Systems, 9310-9321, 2019
182019
Generative adversarial self-imitation learning
Y Guo, J Oh, S Singh, H Lee
arXiv preprint arXiv:1812.00950, 2018
152018
Discovering reinforcement learning algorithms
J Oh, M Hessel, WM Czarnecki, Z Xu, HP van Hasselt, S Singh, D Silver
Advances in Neural Information Processing Systems 33, 2020
72020
Self-Tuning Deep Reinforcement Learning
T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, H van Hasselt, D Silver, ...
arXiv preprint arXiv:2002.12928, 2020
62020
What Can Learned Intrinsic Rewards Capture?
Z Zheng, J Oh, M Hessel, Z Xu, M Kroiss, H van Hasselt, D Silver, S Singh
arXiv preprint arXiv:1912.05500, 2019
52019
A self-tuning actor-critic algorithm
T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ...
Advances in Neural Information Processing Systems 33, 2020
22020
Meta-gradient reinforcement learning with an objective discovered online
Z Xu, HP van Hasselt, M Hessel, J Oh, S Singh, D Silver
Advances in Neural Information Processing Systems 33, 2020
12020
Balancing Constraints and Rewards with Meta-Gradient D4PG
DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann
arXiv preprint arXiv:2010.06324, 2020
2020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20