Miljan Martic
Miljan Martic
DeepMind
Adresse e-mail validée de google.com
Titre
Citée par
Citée par
Année
Deep reinforcement learning from human preferences
P Christiano, J Leike, TB Brown, M Martic, S Legg, D Amodei
arXiv preprint arXiv:1706.03741, 2017
4182017
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
1712017
Scalable agent alignment via reward modeling: a research direction
J Leike, D Krueger, T Everitt, M Martic, V Maini, S Legg
arXiv preprint arXiv:1811.07871, 2018
542018
Penalizing side effects using stepwise relative reachability
V Krakovna, L Orseau, R Kumar, M Martic, S Legg
arXiv preprint arXiv:1806.01186, 2018
212018
Measuring and avoiding side effects using relative reachability
V Krakovna, L Orseau, M Martic, S Legg
arXiv preprint arXiv:1806.01186, 2018
142018
Deep reinforcement learning from human preferences, 2017
P Christiano, J Leike, TB Brown, M Martic, S Legg, D Amodei
arXiv preprint arXiv:1706.03741, 0
7
Avoiding Side Effects By Considering Future Tasks
V Krakovna, L Orseau, R Ngo, M Martic, S Legg
arXiv preprint arXiv:2010.07877, 2020
32020
Scaling shared model governance via model splitting
M Martic, J Leike, A Trask, M Hessel, S Legg, P Kohli
arXiv preprint arXiv:1812.05979, 2018
22018
Algorithms for Causal Reasoning in Probability Trees
T Genewein, T McGrath, G Déletang, V Mikulik, M Martic, S Legg, ...
arXiv preprint arXiv:2010.12237, 2020
12020
Causal Analysis of Agent Behavior for AI Safety
G Déletang, J Grau-Moya, M Martic, T Genewein, T McGrath, V Mikulik, ...
arXiv preprint arXiv:2103.03938, 2021
2021
Meta-trained agents implement Bayes-optimal agents
V Mikulik, G Delétang, T McGrath, T Genewein, M Martic, S Legg, ...
arXiv preprint arXiv:2010.11223, 2020
2020
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–11